Gene Franean1_0785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0785 
Symbol 
ID5669201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp910849 
End bp912057 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content74% 
IMG OID641239713 
Productglycosyl transferase family protein 
Protein accessionYP_001505149 
Protein GI158312641 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.546497 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.577316 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGGAG CTCGTGCATT CGGGAAGGGC ACCGCGATCG CGGTTCTCGG CGCGACCGCC 
GTGTACGGCC ATGTTCTGTA TCCCGCTTAC ATCGGGTACC GCAGTCGTGG GCTTGCGCCG
TCGGTCCCCG CCGAGCCGGA CGTGTGGCCG GGCCTCAGCG TCGTGGTCTC CGCCTACCGC
GAGTCGGCGG TCATCGGCAC CAAGCTCGAC GAGCTGGCCG GTACGGACTA CCCCGGCCCG
ATGGAGATCA TCGTCGTCGC CGACGACGCG GAGACGGCCA CGGCCGCCCG CCGTCCCGGC
GTCCGGGTCC TGTCGTCCGG GGAGCGGCTC GGCAAGGCCC GCGCGGTCAA CCGCGGTGTC
GCCGCGGCCA GCCACGACGT CGTGGTCCTC ACCGACGCCA ACGCGGTGCT GGCCCCGCAC
TCGCTGCGCG CCGCAGCGCG CCACTTCACC GACGAGTCCG TCGGCGCCGT CGCCGGCGAG
AAGCAGGTCG ACGACCCGGC CGGCGCCCAG GGCTTCTACT GGACGTTCGA GTCCTGGCTC
AAGCAGCGCG AGTCCGCGAC CGGCGCCACC ATCGGCGTGG TCGGCGAGAT GCTGGCGTTC
CGCCGCAAGG CGTTCCGGCC GCTGCCGAAG GACACCGCGG TGGACGACGC CTGGCTGGCG
CTCGACATCC TCGAAAGTGG CCTGCGGGTG GTCTATGAGC CCGAGGCCTA CTCGATCGAG
ACCTCCGCGC CGGACTACGC CGCCGAGTGG GAGCGCCGCA CCCGCATCGT CGCCGGCAAC
CTCGACATGC TCTGGCGGCG CCGCGCCGCG CTCGTGCCCG GCGCGCTGCC GGTCACCTCG
CAGCTGTGGG GGCACCGGCT CGTCCGGTCC TCGTTCGGCC CGCTGGCCCA CGTCGCGCTG
GTGGCGATCA GCGTCCCGGC GGCCCGCAAC AGCTGGGGCG CCCGGCTGTT CCTGCTCGGC
AACGCCGCCG GTGCGGCCAG CGCCGGCGTC CTGATGCGGG GCGGGACCCC GCCCGGGCCG
AGCCGCCTGG TGGCCCAGGT CTTCTTCCTG CAGGCCGTCG CGCTCGGTGG CGTCCGCCGC
TTCCTGGCCC GTGACCGGCC GGCGATCTGG CCGAAGCCGG ACCGGCAGCC CGTGCCGGCG
CAGCCGGCGG CGTCCGAAGC CGACATCGAC CCCGCCAGCG CGTCCGGGGC CTCCGTGTCC
GTTGTGTGA
 
Protein sequence
MTGARAFGKG TAIAVLGATA VYGHVLYPAY IGYRSRGLAP SVPAEPDVWP GLSVVVSAYR 
ESAVIGTKLD ELAGTDYPGP MEIIVVADDA ETATAARRPG VRVLSSGERL GKARAVNRGV
AAASHDVVVL TDANAVLAPH SLRAAARHFT DESVGAVAGE KQVDDPAGAQ GFYWTFESWL
KQRESATGAT IGVVGEMLAF RRKAFRPLPK DTAVDDAWLA LDILESGLRV VYEPEAYSIE
TSAPDYAAEW ERRTRIVAGN LDMLWRRRAA LVPGALPVTS QLWGHRLVRS SFGPLAHVAL
VAISVPAARN SWGARLFLLG NAAGAASAGV LMRGGTPPGP SRLVAQVFFL QAVALGGVRR
FLARDRPAIW PKPDRQPVPA QPAASEADID PASASGASVS VV