Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0149 |
Symbol | |
ID | 7408511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 185685 |
End bp | 187049 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643714551 |
Product | PTS system, fructose subfamily, IIC subunit |
Protein accession | YP_002572074 |
Protein GI | 222528192 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1299] Phosphotransferase system, fructose-specific IIC component [COG1445] Phosphotransferase system fructose-specific component IIB |
TIGRFAM ID | [TIGR00829] PTS system, fructose-specific, IIB component [TIGR01427] PTS system, fructose subfamily, IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00017451 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TTGTGGCTGT CACATCTTGC CCCACTGGAA TTGCCCACAC GTACATGGCG GCAGAGGCAC TTCAGATGGC GGCAAAAGAG CTTGGGGTTG AAATCAAGGT CGAAACAAGA GGATCTGTCG GTGCAGAAAA TGAGATAACT CCAGAGGATT TGAAGCAAGC ACATGCGGTA ATTTTAGCTT GTGACACCAA GATTGATGAA GATAGGTTCC AAGGATTGCC AATTGTACGA GCAAGTGTGA AGGATGCTAT CAAAGATCCC AAAGGGCTTA TTACAAAGGC TATGAACATG GAGAAGAAGG ACTATGTTGA TAAGGTCTTT GAGGCCAAAA AAGAAGCAAA AGAAAAAGCA ACTGGTGTTT ACAAGCATTT GATGACAGGT GTTTCTTATA TGATTCCATT TGTTGTCGCA GGTGGTATTT TAATTGCAAT ATCTTTTGCA TTTGGAATTA AAGCTTTTGA GAAAAAAGGA ACACTTGCAG CAGCGCTCAT GGACATAGGC GGTGGCAGTG CATTTTATCT CATGGTTCCA ATTCTGGCTG GCTTTATTGC ATTTTCGATT GCAGACAGGC CTGGGCTTGT ACCAGGAATG ATAGGTGGGC TTTTGGCAAA TAAGCTTGGA GCAGGTTTTT TGGGCGGCAT TGTTGCAGGG TTTGCAGCCG GGTATTTAGT TGCATGGCTT AAGAAAACCA TAAAGTTACC AAAGACAATG GAAGGTTTGA TACCAGTATT GATACTTCCT GTTTTGTCAA CATTGATTAT TGGTTTGGGT ATGATATATG TTGTAGGTGA ACCTGTTGCA GCTTTAAATA AGGCAATGAC TGAATGGCTC AAGAGCATGA GCAGTGGCAG CGCTGTGTTA CTTGGAATAA TTTTGGGTTT GATGATGGCG TTTGATATGG GTGGACCTGT CAACAAAGCT GCATATACAT TTGCTGTATC AACTTTGGCG GCAGGTCAGC CATCAACAAT AATGGCAGCT GTCATGGCAG CTGGCATGAC ACCACCACTT GGACTTGCGC TTGCCACCTT GATAGCAAAG GATAAATTTA CAACAGAAGA AAGAGAAGCA GGAAAGGCGG CATTCTTCCT TGGGATTTCA TTTATAACTG AAGGGGCTAT TCCATTTGCT GCTGCAGATC CACTGAGGGT TATTCCATCT ATTATGATTG GATCAGCAGT AACTTCGGCC CTGAGTATTT TGTTCAAATG TACTTTGGCA GTTCCACATG GCGGGATATT TGTACTGCCG ATACCAAACG CTGTAGGAAA TTTACTCTTG TATGCGGTGG CTATTGCAAT AGGAACAGTT GTAACAGCAC TTATAGTTTC GGTTTTAAAG CCAAAGAAGG TTTGA
|
Protein sequence | MKKIVAVTSC PTGIAHTYMA AEALQMAAKE LGVEIKVETR GSVGAENEIT PEDLKQAHAV ILACDTKIDE DRFQGLPIVR ASVKDAIKDP KGLITKAMNM EKKDYVDKVF EAKKEAKEKA TGVYKHLMTG VSYMIPFVVA GGILIAISFA FGIKAFEKKG TLAAALMDIG GGSAFYLMVP ILAGFIAFSI ADRPGLVPGM IGGLLANKLG AGFLGGIVAG FAAGYLVAWL KKTIKLPKTM EGLIPVLILP VLSTLIIGLG MIYVVGEPVA ALNKAMTEWL KSMSSGSAVL LGIILGLMMA FDMGGPVNKA AYTFAVSTLA AGQPSTIMAA VMAAGMTPPL GLALATLIAK DKFTTEEREA GKAAFFLGIS FITEGAIPFA AADPLRVIPS IMIGSAVTSA LSILFKCTLA VPHGGIFVLP IPNAVGNLLL YAVAIAIGTV VTALIVSVLK PKKV
|
| |