Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5901 |
Symbol | |
ID | 5674222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7168025 |
End bp | 7169095 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641244749 |
Product | glucose-1-phosphate thymidyltransferase |
Protein accession | YP_001510151 |
Protein GI | 158317643 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1209] dTDP-glucose pyrophosphorylase |
TIGRFAM ID | [TIGR01207] glucose-1-phosphate thymidylyltransferase, short form [TIGR01208] glucose-1-phosphate thymidylylransferase, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCAC TCGTGCTCGC CGGTGGCTCC GGGACCCGTC TTCGGCCGAT CACCCACACC TCGGCCAAGC AACTCGTGCC CGTCGCCAAC AAGCCGGTGC TGTTCTACGG GCTCGAGGCG ATCCGCGGCG CCGGGATCAC CGATGTCGGG ATCATCGTCG GCGAGACGGC GGCCGAGATC GAGAACGCGG TTGGTGACGG ATCGCAGTTC GGCATCACCG TCACCTACAT CCGCCAGGAG GCGCCGCTCG GTCTGGCACA CGCCGTCCTC ATCGCGCGGG ACTTCCTCGC CGACGAGCCG TTCGTGATGT ACCTCGGCGA CAACATGATC ATCGGTGGGA TCTCCGGGCT GGTCGAGGAG TTCCGGCACA CCACCTCCGA TGCGCTCATC CTGCTCACCA AGGTCGACAA CCCCTCGGCG TTCGGCGTCG CCGAGCTGGG AGCGGACGGC CGGATCATCC GGCTCGTCGA GAAGCCCGCC GACCCGCCGA GCGACCTCGC GCTCGTCGGC GTCTACATGT TCGGTCTCGC GATCCACGAG GCCGTCCGCT CGATCAAGCC CTCAGGACGC GGCGAGCTCG AGATCACCGA AGCGATCCAG TGGCTCGTGG ACGGCGGCTA CGACGTCGCG CCGCACCTTG TCGAGGGCTA CTGGAAGGAC ACCGGCCGGC TCGACGACAT GCTGGAGACC AACCGGCACA TCCTCGAGTC AATCGAGCCC GCGATCCACG GCACCGTGGA CGAGCACAGC ACCATCGTCG GCCGGGTGGT GATCGAGGAG GGTGCGTCCC TGGTGCGCTC GACGGTGCGC GGCCCGGCGA TCATCGGCCG CGGCACCCGG CTCGTCGACA CCTACGTAGG CCCCTTCACC TCGATCTACC ACTCGTGCGT CGTCGAACGA ACCGAGATCG AGTACTCGAT CGTGCTCGAG CGGGCCACCA TCCGCGGCAT CGGCCGCATC GAGGACTCCC TGATCGGGCG GGACGCCGAG GTCGTACCGT CGAGCGCTCT CCCCAAGGCG CACCGCCTGA TGATCGGCGA TCACTCCCGG GTCTCGGTCG CAACGAGCTA G
|
Protein sequence | MKALVLAGGS GTRLRPITHT SAKQLVPVAN KPVLFYGLEA IRGAGITDVG IIVGETAAEI ENAVGDGSQF GITVTYIRQE APLGLAHAVL IARDFLADEP FVMYLGDNMI IGGISGLVEE FRHTTSDALI LLTKVDNPSA FGVAELGADG RIIRLVEKPA DPPSDLALVG VYMFGLAIHE AVRSIKPSGR GELEITEAIQ WLVDGGYDVA PHLVEGYWKD TGRLDDMLET NRHILESIEP AIHGTVDEHS TIVGRVVIEE GASLVRSTVR GPAIIGRGTR LVDTYVGPFT SIYHSCVVER TEIEYSIVLE RATIRGIGRI EDSLIGRDAE VVPSSALPKA HRLMIGDHSR VSVATS
|
| |