Gene Franean1_3677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3677 
Symbol 
ID5672043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4353396 
End bp4355135 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content73% 
IMG OID641242560 
Productglycosyl transferase family protein 
Protein accessionYP_001507980 
Protein GI158315472 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.846949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTACC AGGCAGCGCC CGCCGACGCG GCCGGCCCGG CTTCACCCGA GTTCGCGCCA 
CCGTCAGCAC CGCCCAGCCT GGCTCCGCGT GAGCCGTCCG CGGTCGCGGC CCCGGCCGTC
CCGGCGATCC CGGCGCAGGC CGTGGCCGGC GACATCCGCG ACGCCGCCCG GGCCGGGATC
CCGCCGCTGG ACGGCGTCGA CACCTGGGTC CGGGTGCCCG TCGCGTGGTG GAGGGCCGTC
GCCGGACGGG TCGTCGGGGT CGCGATCACG CTGATCGGCG CGGTGTACGC GGTGTGGCGC
GCCGGGACGC TGGACGGCAC CGGCGTGGCC GGCCACCTGT TCTACGCGGC CGAGATCGTC
AGCTACCTCA CGATCGTGTG GACGGCGGTG ATGACCGGCC GGATGCGCAC CGGCCACGTC
CGGCGCGCCC CGGCGCCAGC CGGCACCCTC GACGTCTTCG TCACCGTCTG CGGCGAGCCG
GTCGAGATGG TCGAGGCGAC GCTGCGCGCG GCGCTGGCGA TCGACTACCC GCACCGCACC
TACGTCCTCA ACGACGGGCG GATCGCCGGC CGGCCCAACT GGCGCGACAT CGACGCCCTC
GCCGCTCGGC TCGGGATCAT CTGCTTCACC CGCACCGACG GGCCTCGCGG CAAGGCCGCG
AACCTCAACC ACGGCCTGGC CCGCACCGAC GGCGACGCGA TCATGACGCT GGACGCCGAC
CACATCGCGG TGCCCGATCT CGGCGAGCTG GTCCTCGGCT ACCTGCGGGA CCCGAAGGTC
GGGTTCGTCT GCACCGAGCA GCGCTTCGAC GTCGGCCGCC ATGACGTGCT CAACAACGCC
GAACCGATGC TGTACAAGGC GGTGCAGCCG GCGAAGGACC GCGACAGCGC CGCGTCGTCC
TGCGGGAACG GCACCCTGTA CCGGCGGACG GCGGTTGAGT CCGTCGGCGG CTTCAGTGAG
TGGAACATCG TCGAGGACCT GCACACCTCC TACCAGCTGC ACGCCGCCGG CTGGCAGAGC
GTCTACCACC ACGGCCCGGT GTCCGTCGGG ATCGCGCCGG CCACCGCGGC GGAGTACGCC
AAGCAGCGCA GCCGGTGGGC GATGGACGGC CTGCGCCTGC TGCTGTTCGA CAACCCGCTG
CGCAAGCCCG GCCTGACCGG CTGGCAGCGG GCGCACTACC TGCATACCGG GATCGGCTAC
CTGGTGGCGT GCGCGCAGAT GATGTTCCTG CTGGGGCCGC CGCTGAGCGT GCTGGCCGGG
GTCCAGATCG CGGCCGGGGT GTCGCTGACC GCCTACGTGC TGCACGCCCT GCCGTACCTG
GTCGGCTCGC TGCTGTTCAT CGTCGCCTAC ACCGGGCCGC GCGGAGCCCA GCGGACGGTG
GCCAGCACCC TGTTCAACGC TCCGCTGTAC GCGCTGTCGT TCGTGCGGGT CGTGCTCTCC
GGCCGACCCG ACTCCGGCGC GACCGCGAAG ACCGCGCTGC CGCGGATGTC GCTCCTGCTG
CTGCCCCAGG TGCTTTTCGC CGCCAGTCTG GTGGTCACCA TTCTCGTCGT CGGCGTCAGC
CCGGACGTGG CCGACCTGTC CGCGCTGGTG TGGGCCGGGG TGCTGCTGTC GATGGTGGCC
GGGCCGCTGT CGGCGCTCTC GGAACGCCAG GACCGGGTGG AGCGGGCCCA GCTGCCGATC
CGGGCCGTCA TCCTCGGACT GGTCCTGAGC TTCGCGGTCG TCACCCTCCT GGAGGGCTGA
 
Protein sequence
MAYQAAPADA AGPASPEFAP PSAPPSLAPR EPSAVAAPAV PAIPAQAVAG DIRDAARAGI 
PPLDGVDTWV RVPVAWWRAV AGRVVGVAIT LIGAVYAVWR AGTLDGTGVA GHLFYAAEIV
SYLTIVWTAV MTGRMRTGHV RRAPAPAGTL DVFVTVCGEP VEMVEATLRA ALAIDYPHRT
YVLNDGRIAG RPNWRDIDAL AARLGIICFT RTDGPRGKAA NLNHGLARTD GDAIMTLDAD
HIAVPDLGEL VLGYLRDPKV GFVCTEQRFD VGRHDVLNNA EPMLYKAVQP AKDRDSAASS
CGNGTLYRRT AVESVGGFSE WNIVEDLHTS YQLHAAGWQS VYHHGPVSVG IAPATAAEYA
KQRSRWAMDG LRLLLFDNPL RKPGLTGWQR AHYLHTGIGY LVACAQMMFL LGPPLSVLAG
VQIAAGVSLT AYVLHALPYL VGSLLFIVAY TGPRGAQRTV ASTLFNAPLY ALSFVRVVLS
GRPDSGATAK TALPRMSLLL LPQVLFAASL VVTILVVGVS PDVADLSALV WAGVLLSMVA
GPLSALSERQ DRVERAQLPI RAVILGLVLS FAVVTLLEG