Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0986 |
Symbol | |
ID | 5669400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1154192 |
End bp | 1155319 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641239914 |
Product | thiamine monophosphate kinase |
Protein accession | YP_001505348 |
Protein GI | 158312840 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.243699 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACAC GCGGCCAGCC GGACACCGAA AGCACCGACG ACACCACCGA CCCCGGCAGC ACCGGCAGCA CCGGCAGCGG GCCGACGGTG GCGGAGACGG GTGAGTTCGG GCTGATCCGG GCGATCACCC GACGCCTCCC CGTCGGGGCG GACGTCCTGC TCGGGCCCGG TGACGACGCG GCGGTGGTGG CCGCTCCCGA CGGACGGGTG GTCGTCACGA CCGACCTGCT CGTCGAGGGC CGGCATTTCC GGCGCGACTG GTCCTCCGCG TACGACACAG GGCGCAAGGC GGCGGCGCAG AACCTCGCCG ACGTCGCCGC CATGGGCGCG CGCCCGACCG CCCTGGTGGT GGGTTTCGCC GCGCCCGGCG AGCTGCCCGT CGCCTGGGCC GAGCAGCTCG CCGACGGCCT GCGTGACGAG TGCGCGCTGG TCGGCGCGTC CGTGGCCGGC GGCGACGTCA GCTCGGCCGC CGGGATCGTG CTGGGGATCA CCGCGCTCGG CGACCTCGCC GGACGAGCCC CGATCCGCCG GGACGGCGCC CGCCCCGGCG ACCGGGTGGT GCTGGCGGGA CGGATCGGAT GGGCCGAGGC CGGCCTCGCC CTCCTGCGCG CCACCGAGCT CCCCCCCGAG ATCCTGCGGG CGCACGCCGA GGTTGTGGAC GCGCACCGGC GGCCCCACCC GCCCTACACG CTCGGCCCGC TGCTCGCGGC GGCGGGGGCG CACGCGATGT GCGACGTGTC GGACGGTCTC CTCGCCGATC TCGGCCATGT CGCGGCGGCC TCCGCCGTGT GGATCGACAT CGACCCCACG GCGCTGCCGG TGCCCGGACC GATCCGGGAC GCCGCCGCGG TGCTGGGCGC CGATCCGCTG ACCTGGGTTC TCACTGGAGG TGACGACCAC GCGCTGGTCG CCTGCCTTGC TCCTACCGCC CTGCTTCCGG CCGGCTGCGT CGTCATCGGA CGGGTGCTGG CCGTCCCCGC CGATGCGGTC TCGGGTTCGG CCGCCCCCAC CCATGTGGGC CCGGACGGGA CCCGGCCTGA TGACCGCGGC GCGGGTCACG GGGTGCTCGT CGGCGGGCTC GACTACCGGC AGGGCACCGG ATGGGACCAC TTCCGCCAGA CCACCTGA
|
Protein sequence | MSTRGQPDTE STDDTTDPGS TGSTGSGPTV AETGEFGLIR AITRRLPVGA DVLLGPGDDA AVVAAPDGRV VVTTDLLVEG RHFRRDWSSA YDTGRKAAAQ NLADVAAMGA RPTALVVGFA APGELPVAWA EQLADGLRDE CALVGASVAG GDVSSAAGIV LGITALGDLA GRAPIRRDGA RPGDRVVLAG RIGWAEAGLA LLRATELPPE ILRAHAEVVD AHRRPHPPYT LGPLLAAAGA HAMCDVSDGL LADLGHVAAA SAVWIDIDPT ALPVPGPIRD AAAVLGADPL TWVLTGGDDH ALVACLAPTA LLPAGCVVIG RVLAVPADAV SGSAAPTHVG PDGTRPDDRG AGHGVLVGGL DYRQGTGWDH FRQTT
|
| |