Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0785 |
Symbol | |
ID | 5669201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 910849 |
End bp | 912057 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641239713 |
Product | glycosyl transferase family protein |
Protein accession | YP_001505149 |
Protein GI | 158312641 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.546497 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.577316 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGGAG CTCGTGCATT CGGGAAGGGC ACCGCGATCG CGGTTCTCGG CGCGACCGCC GTGTACGGCC ATGTTCTGTA TCCCGCTTAC ATCGGGTACC GCAGTCGTGG GCTTGCGCCG TCGGTCCCCG CCGAGCCGGA CGTGTGGCCG GGCCTCAGCG TCGTGGTCTC CGCCTACCGC GAGTCGGCGG TCATCGGCAC CAAGCTCGAC GAGCTGGCCG GTACGGACTA CCCCGGCCCG ATGGAGATCA TCGTCGTCGC CGACGACGCG GAGACGGCCA CGGCCGCCCG CCGTCCCGGC GTCCGGGTCC TGTCGTCCGG GGAGCGGCTC GGCAAGGCCC GCGCGGTCAA CCGCGGTGTC GCCGCGGCCA GCCACGACGT CGTGGTCCTC ACCGACGCCA ACGCGGTGCT GGCCCCGCAC TCGCTGCGCG CCGCAGCGCG CCACTTCACC GACGAGTCCG TCGGCGCCGT CGCCGGCGAG AAGCAGGTCG ACGACCCGGC CGGCGCCCAG GGCTTCTACT GGACGTTCGA GTCCTGGCTC AAGCAGCGCG AGTCCGCGAC CGGCGCCACC ATCGGCGTGG TCGGCGAGAT GCTGGCGTTC CGCCGCAAGG CGTTCCGGCC GCTGCCGAAG GACACCGCGG TGGACGACGC CTGGCTGGCG CTCGACATCC TCGAAAGTGG CCTGCGGGTG GTCTATGAGC CCGAGGCCTA CTCGATCGAG ACCTCCGCGC CGGACTACGC CGCCGAGTGG GAGCGCCGCA CCCGCATCGT CGCCGGCAAC CTCGACATGC TCTGGCGGCG CCGCGCCGCG CTCGTGCCCG GCGCGCTGCC GGTCACCTCG CAGCTGTGGG GGCACCGGCT CGTCCGGTCC TCGTTCGGCC CGCTGGCCCA CGTCGCGCTG GTGGCGATCA GCGTCCCGGC GGCCCGCAAC AGCTGGGGCG CCCGGCTGTT CCTGCTCGGC AACGCCGCCG GTGCGGCCAG CGCCGGCGTC CTGATGCGGG GCGGGACCCC GCCCGGGCCG AGCCGCCTGG TGGCCCAGGT CTTCTTCCTG CAGGCCGTCG CGCTCGGTGG CGTCCGCCGC TTCCTGGCCC GTGACCGGCC GGCGATCTGG CCGAAGCCGG ACCGGCAGCC CGTGCCGGCG CAGCCGGCGG CGTCCGAAGC CGACATCGAC CCCGCCAGCG CGTCCGGGGC CTCCGTGTCC GTTGTGTGA
|
Protein sequence | MTGARAFGKG TAIAVLGATA VYGHVLYPAY IGYRSRGLAP SVPAEPDVWP GLSVVVSAYR ESAVIGTKLD ELAGTDYPGP MEIIVVADDA ETATAARRPG VRVLSSGERL GKARAVNRGV AAASHDVVVL TDANAVLAPH SLRAAARHFT DESVGAVAGE KQVDDPAGAQ GFYWTFESWL KQRESATGAT IGVVGEMLAF RRKAFRPLPK DTAVDDAWLA LDILESGLRV VYEPEAYSIE TSAPDYAAEW ERRTRIVAGN LDMLWRRRAA LVPGALPVTS QLWGHRLVRS SFGPLAHVAL VAISVPAARN SWGARLFLLG NAAGAASAGV LMRGGTPPGP SRLVAQVFFL QAVALGGVRR FLARDRPAIW PKPDRQPVPA QPAASEADID PASASGASVS VV
|
| |