Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2123 |
Symbol | |
ID | 5670523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2550384 |
End bp | 2552267 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641241044 |
Product | transketolase |
Protein accession | YP_001506465 |
Protein GI | 158313957 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0021] Transketolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAACTC CCGTGTCCGA AGTGGTCGGT GTCGCCGACC TGGAGACGGT TCGGGAACTC GCACAGCAGC TGCGGGTCGA CTCGGTGCGC AGCAGCACCT CGGCCGGATC CGGTCACCCG ACCTCCTCCA TGTCCGCCGC GGACGTCATG GCGGTGCTGC TGACGCGCCA CCTGCGCTAC GACTGGGACA CCCCGGACGA CCCGGCGAAC GACCACCTGA TCTTCTCCAA GGGGCACGCG TCGCCCCTGA TCTACTCGAT GTTCAAGGCG GTCGGCGTCG TCACCGACGA CGAGCTCATG ACCGGCTACC GCCGGTTCGG GCACCGCCTG CAGGGCCACC CGACCCCGGT CCTGCCCTGG GTCGACGTGG CGACCGGATC ACTCGGCCAG GGCCTCCCCG ACGCCGTCGG GGTCGCGCTG GCCGGCAAGT TCCTGGACAA GGTCCCGTAC CGGGTCTGGG TGGTCTGCGG CGACAGCGAG ATGGCTGAGG GCTCGATGTG GGAGGCGCTG GACAAGGCCG CCTTCTACGG CCTGTCCAAC CTGATCGCGA TCGTGGACGT CAACCGTCTC GGCCAGCGCG GCCCGACCGA GTTCGGCTGG GACATGGACG CCTACGCCCG CCGGGTCGAG GCGTTCGGCG CCCGCGCGGT CGTGATCGAC GGCCACGACC TGGGCGCGAT CGACGCCGCG CTCGCCGACG CCGAGGACGC GACGCGGCCC ACGGTGATCC TGGCCCGCAC CCGCAAGGGC GAAGGCTTCT CAGAGGTCGC CGATGCCGAG GGCTGGCACG GCAAGCCGCT GCCCGAGGAC ATGGCGACCC GAGCGATCAT CGAGCTGGGC GGCGAGCGGA ACCTGCGCGT GCGCGGCCCG GTTCCCCCGG CGGACATGGC CCGCCCGGCC CCGGCCCAGA AGCCGGCGAT CAAGCTGCCG ACCTACGAGC TGGGCGCCAA GGTCGCCACC AGGCGGGCCT ACGGCCAGGC GCTGGCCGCG GTCGGCGCGC ACACCGACGT CGTCGCCCTC GACGCCGAGG TGAGCAACTC GACCTACGCC GAGGACTTCG CCAAGGCGCA CCCTGACCGG TTCTTCGAGA TCTTCATCGC CGAGCAGCAG CTCGTCGCGG CGGCGGTCGG CCTGTCGGTG CGCGGCTACG TTCCGTTCGC CTCCACGTTC GCCGCGTTCT TCTCCCGCGC CTACGACTTC ATCCGGATGG CGGCCATCTC GCAGGTCGAC ATCCGGCTGG CCGGGTCGCA CGCCGGGGTG GAGATCGGTG CGGACGGCCC GTCCCAGATG GCGCTCGAGG ATCTCGCGAT GATGCGGGCC GTGGGCGGCT CGACGGTGCT CTACCCGAGC GACGCGACCT CCGCGGCCAA GCTGATCGAG GCGATGGTCG GGATCTCCGG CGTCAGCTAC ATGCGGACGA CCCGCGGTGC CTACCCGGTG CTCTACGGCC CGGACGAGAC GTTCGCGCCG GGCGGTTCCA AGGTGCTGCG CAGCTCCGAC GCCGACACCG TGACGCTGAT CGGCGCCGGG GTCACCCTGC ACCACTGCCT GGCCGCGGCC GACGTGCTCG CGGCCGAGGG AGTCTCGGCA CGGGTGATCG ACCTGTACTC GGTCAAGCCG GTGGACACCG CGACCCTGGC GGAGGCGGCG CGGGCGACCG GTGGCCGCCT CGTCGTCGCG GAGGACCACT ACGCCGAGGG CGGCCTCGGC GGCGCGGTGC TCGAGGGCCT CGCGGGCGTC CTGCGCGACG GCAGCGTCTC GCTGCGGCTG GCGCATCTTG CCGTCGGTGA GCTTCCCGGT TCCGGTACCC CGGACGAGCT GCTGGACTTC GCCGGCATCT CCACGCCGCA CATCGTCGAG GCGGCGCGCG GCCTGCTGCG CTGA
|
Protein sequence | MATPVSEVVG VADLETVREL AQQLRVDSVR SSTSAGSGHP TSSMSAADVM AVLLTRHLRY DWDTPDDPAN DHLIFSKGHA SPLIYSMFKA VGVVTDDELM TGYRRFGHRL QGHPTPVLPW VDVATGSLGQ GLPDAVGVAL AGKFLDKVPY RVWVVCGDSE MAEGSMWEAL DKAAFYGLSN LIAIVDVNRL GQRGPTEFGW DMDAYARRVE AFGARAVVID GHDLGAIDAA LADAEDATRP TVILARTRKG EGFSEVADAE GWHGKPLPED MATRAIIELG GERNLRVRGP VPPADMARPA PAQKPAIKLP TYELGAKVAT RRAYGQALAA VGAHTDVVAL DAEVSNSTYA EDFAKAHPDR FFEIFIAEQQ LVAAAVGLSV RGYVPFASTF AAFFSRAYDF IRMAAISQVD IRLAGSHAGV EIGADGPSQM ALEDLAMMRA VGGSTVLYPS DATSAAKLIE AMVGISGVSY MRTTRGAYPV LYGPDETFAP GGSKVLRSSD ADTVTLIGAG VTLHHCLAAA DVLAAEGVSA RVIDLYSVKP VDTATLAEAA RATGGRLVVA EDHYAEGGLG GAVLEGLAGV LRDGSVSLRL AHLAVGELPG SGTPDELLDF AGISTPHIVE AARGLLR
|
| |