Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1512 |
Symbol | |
ID | 7976599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1584242 |
End bp | 1586113 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644798409 |
Product | PTS system, fructose subfamily, IIC subunit |
Protein accession | YP_002949582 |
Protein GI | 239826958 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1299] Phosphotransferase system, fructose-specific IIC component |
TIGRFAM ID | [TIGR00829] PTS system, fructose-specific, IIB component [TIGR00848] PTS system, fructose subfamily, IIA component [TIGR01427] PTS system, fructose subfamily, IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000350477 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATTA CCGATTTGCT TACAAAGGAA ACAATTATTC TCCATCTGAA AGCGAAAACA AAAGAAGAAG TAATTGACGA ACTTGTCGCG AAACTGCAAG AAGCAGGAGT GTTGCGTGAC GCACAAGCGT TCAAAGAAGC GATTTTTGCG CGTGAAGCAC AAAGCACGAC CGGTGTCGGT GATGGAATTG CCATTCCTCA TGCGAAAACA GCCGCGGTAA AGCGGCCTGC CGTAGCGTTT GGCCGTTCTG AGAGCGGCAT TGACTATGAC GCGCTTGATG GAAAACCGAG CCGCTTGTTT TTTATGATCG CAGCGCCAGA AGGAGCGAAT AATACACATT TAGAAGCGCT TGCCCGCTTA TCGTCTATGC TAATGGATTC CTCTTTCCGC GCGCGGATTG AAAGCGTTTC AAATGAAGAG GAATTTATTC GGTTGATTGC AGAAAAAGAG GCAGAGGAAA CGAAAGAAGC AGAACATACG GCATCTTCAC CTTCGAAGCG CAAAAAAGTC ATTGCTGTCA CTGCTTGTCC GACAGGAATC GCTCATACGT ATATGGCGGC AGATGCTTTA AAAGCAAAAG CAGCGGAAAT GGATGTGGAT ATAAAAGTAG AAACAAACGG TTCTAGCGGA GTGAAAAACG AATTAACCAA GCAAGATATT GAGGAAGCGG TTGCGGTCAT CGTTGCGGCG GATAAGCAAG TGGAAATGGA GCGGTTCAAA GGAAAACATG TGATTCAAGT ACCGGTTGCG CAAGCGATTC GTAAACCGAA AGAGCTAATC GAACAAGCTC TCCGGCAAGA TGCGCCGATT TATCAAGGAA GCGGCGCGAA AGAGGCGACA ACGGTAGGAA AGCCGCGTAC AGGATTTTAT AAACATTTAA TGAACGGAGT TTCCAACATG CTTCCGTTCG TTGTCGGCGG CGGTATTTTA ATTGCGATTT CATTTATCTT CGGTATTAAA GCGTTTGATC CAAAAGATCC GTCGTATCAT CCGATAGCGA AAGCGCTTAT GGATATCGGT GGAGGAAACG CCTTTGCGTT GATGATTCCA GTGCTTGCTG GTTTTATTGC CATGAGTATT GCGGACCGGC CTGGTTTTGC GCCAGGGATG GTAGGCGGTT TCATGGCGGC AAATGGCGGC GCCGGCTTTT TAGGCGGGTT AATCGCAGGC TTTCTTGCCG GGTATTTAGT GGTTGGGTTG AAAAAAGTAT TTAGCCATCT GCCACAGTCG CTTGAGGGAA TCAAACCAGT GTTGCTTTAT CCGCTTTTTG GCATTTTCAT TACGGGACTT ATTATGATGT ATGTTGTTAT CGATCCAGTG AAGGCGTTAA ATGAGGCAAT GAAGCATTGG CTTGAAAATA TGGGAACGGC AAACTTAATT TTACTTGGCG CGATTCTTGG CGGCATGATG GCGGTTGATA TGGGCGGCCC GATCAACAAA GCGGCATTTA CGTTTGGAAT TGCCATGATT GATGCTGGAA ATTATGCTCC GCACGCAGCG ATCATGGCCG GAGGAATGGT GCCGCCGTTA GGGCTGGCTC TTGCGACGAC ATTCTTTAAA AAGAAATTTA CAAAAGCGGA ACGTGAAGCT GGAAAAACAT GCTATATCAT GGGAGCGACG TTCATTACAG AAGGGGCGAT TCCGTTTGCG GCAGCCGACC CGGTACGCGT CATTCCATCC ATCATTGTCG GCTCTGCGGT CAGCGGAGCG CTGACGATGT TGTTTCACAT TGGCCTTCCA GCTCCGCATG GTGGAATTTT CGTTATTCCA ATTGTAAAAG GAAGTGCTTT ATTATACGTA TTAGCGATTT TGATCGGTTC GATCATTACT GCTTTGATGG TTGGCTTGTG GAAAAAAGAA GTAGAGGAAT AA
|
Protein sequence | MKITDLLTKE TIILHLKAKT KEEVIDELVA KLQEAGVLRD AQAFKEAIFA REAQSTTGVG DGIAIPHAKT AAVKRPAVAF GRSESGIDYD ALDGKPSRLF FMIAAPEGAN NTHLEALARL SSMLMDSSFR ARIESVSNEE EFIRLIAEKE AEETKEAEHT ASSPSKRKKV IAVTACPTGI AHTYMAADAL KAKAAEMDVD IKVETNGSSG VKNELTKQDI EEAVAVIVAA DKQVEMERFK GKHVIQVPVA QAIRKPKELI EQALRQDAPI YQGSGAKEAT TVGKPRTGFY KHLMNGVSNM LPFVVGGGIL IAISFIFGIK AFDPKDPSYH PIAKALMDIG GGNAFALMIP VLAGFIAMSI ADRPGFAPGM VGGFMAANGG AGFLGGLIAG FLAGYLVVGL KKVFSHLPQS LEGIKPVLLY PLFGIFITGL IMMYVVIDPV KALNEAMKHW LENMGTANLI LLGAILGGMM AVDMGGPINK AAFTFGIAMI DAGNYAPHAA IMAGGMVPPL GLALATTFFK KKFTKAEREA GKTCYIMGAT FITEGAIPFA AADPVRVIPS IIVGSAVSGA LTMLFHIGLP APHGGIFVIP IVKGSALLYV LAILIGSIIT ALMVGLWKKE VEE
|
| |