Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0013 |
Symbol | |
ID | 3831885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 13155 |
End bp | 14543 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637827940 |
Product | PTS fructose IIC component |
Protein accession | YP_428896 |
Protein GI | 83588887 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1299] Phosphotransferase system, fructose-specific IIC component [COG1445] Phosphotransferase system fructose-specific component IIB |
TIGRFAM ID | [TIGR00829] PTS system, fructose-specific, IIB component [TIGR01427] PTS system, fructose subfamily, IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.991255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.00000000838432 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGAAAAAAA TAGTTGCTGT TACTTCCTGC CCGACAGGAA TAGCCCACAC CTATATGGCA GCGGAAGCTT TGCAAAAGGC AGCTAAAGAA CAGGGAGTGG CGATAAAAGT AGAAACCCGT GGGTCTATTG GGGTTGAAAA TGAACTGACG CCAGCAGATA TCGAAGAGGC CGTGGCAGTT ATTATAGCTG CAGATGCTAA AGTAGATGAG GAAAAATTTC AAGGGAAGCC CATAGTTAGG TCTTCAACGG GAGAAGCTAT TAAAAATGCT AAAACTTTGA TAGACAAAAC TTTAAGCTTG GAGGGTAAAG CGTCCTCAAG GCAAATTACT GATTTTATTC GTGAGGTCGA GCAGCGCAAA AGTGAGCGGA GGTCCCAGGC TACTGGTTTT TATAAACACT TAATGACAGG CGTATCGTAT ATGATACCCT TTGTGGTAGC AGGTGGAATA ATCATTGCTC TTTCCTTCAT ATTTGGTATC GAGGCCTTTA AGCAAGAGGG TACTTTAGCC GCCAACCTTA TGCGTATTGG TGGCGGTTCA GCTTTTGCCT TAATGGTGCC AATACTGGCA GGGTATATTG CATTTTCCAT TGCTGATCGG CCGGGCCTGG CTCCAGGAAT TATAGGAGGT ATGCTGGCTA CTCAAATGGG GGCAGGTTTT CTGGGAGGAA TTGTTGCTGG ATTTCTGGCT GGTTATGTGG CCAAATTTTT ACGAGATAAT ATTAAGCTCC CTGCGGGTCT GGAAGGATTA AAACCGGTAT TAATAATTCC ACTAATTTCT ACTTTAATAG TGGGTTTGTT GTTAATCTAC GTTATAGGAA CTCCCGTAAA AGTTATAATG GATGGTCTTG AACATTGGTT GACGTCAATG AGCAGAGGCA ATGCTGTTAT TTTAGGCTTT ATTCTAGGTG CTATGATGGC CTTAGATATG GGAGGACCGG TAAACAAAGC GGCTTATACC TTTGCGGTTG GGTTGCTTGG TAGCAATATT TATGAGCCCC AGGCTGCGGT TATGGCTGCC GGCATGACGC CTCCTTTAGG TTTAGCTCTA GCTACTTTGC TGTTCCCTAA AAAGTTTACT AGCGAAGAAA GAGAGGCTGG TAAGGCAGCA GCAGTTTTAG GGATTTCTTT TATTACCGAA GGCGCTATTC CCTTTGCTGC ATCTGATCCT TTCAGGGTGA TTCCATCTAT TGTGGCAGGT TCAGCCGTAG CTGGTGCTCT CTCCATGGCT TTTAATGCAA CCCTGAGGGC GCCACATGGA GGCATTTTCG TCCTTGCCAT CCCTAATGCA GTAGGGCACT TAGGATTATA TAGCCTGTCT ATTGCCATTG GTACCCTCGT CACGGCTTTA ATGGTGTCAC TTTTGAAACC CAACAAAGAA ATAAAATAA
|
Protein sequence | MKKIVAVTSC PTGIAHTYMA AEALQKAAKE QGVAIKVETR GSIGVENELT PADIEEAVAV IIAADAKVDE EKFQGKPIVR SSTGEAIKNA KTLIDKTLSL EGKASSRQIT DFIREVEQRK SERRSQATGF YKHLMTGVSY MIPFVVAGGI IIALSFIFGI EAFKQEGTLA ANLMRIGGGS AFALMVPILA GYIAFSIADR PGLAPGIIGG MLATQMGAGF LGGIVAGFLA GYVAKFLRDN IKLPAGLEGL KPVLIIPLIS TLIVGLLLIY VIGTPVKVIM DGLEHWLTSM SRGNAVILGF ILGAMMALDM GGPVNKAAYT FAVGLLGSNI YEPQAAVMAA GMTPPLGLAL ATLLFPKKFT SEEREAGKAA AVLGISFITE GAIPFAASDP FRVIPSIVAG SAVAGALSMA FNATLRAPHG GIFVLAIPNA VGHLGLYSLS IAIGTLVTAL MVSLLKPNKE IK
|
| |