Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_0412 |
Symbol | |
ID | 5877216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | + |
Start bp | 423268 |
End bp | 424668 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641540746 |
Product | PTS system, N-acetylglucosamine-specific IIBC subunit |
Protein accession | YP_001662058 |
Protein GI | 167039073 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific [COG1264] Phosphotransferase system IIB components |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000209524 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTGGT TAGCAAGTGT TCAAAAACTT GGTAAAGCCT TAATGCTTCC AGTTGCAGTT TTACCTGCAG CAGCATTGCT TTTAAGGCTT GGAGCACCTG ATGTTTTCAA CATTCCTTTT ATCATGCAGG CAGGTGCAGC GGTATTTGAC AATTTACCCC TCATCTTTGC TATAGGTATT GCAATAGGTT TTGCAGAAGG GGACGGAGTT GCGGCTCTTG CTGCAGCGGT AGGCTATTTT GTACTGACAA AAGGTGCTAC CACAATAAAC AAAGACATCA ATATGGGTGT TTTAGGCGGT ATATTGATGG GTATCATCGC AGGTTATTTG TATAACAAAT ACCATGATAT TAAACTTCCT GATTTCTTAG GTTTCTTTGG AGGAAAGAGA TTTGTTCCAA TTGTAACGGC ATTTGCTGCA ATTGTACTGG CACTTGTGAT GGGGTATGTA TGGCCTCCAA TACAGAATGG CATATATGCG CTTGGTGAGT GGATAATTGG CGCAGGAGCA CTTGGAGTAT TTGTATACGG TGTGCTAAAT AGGCTTTTGA TTCCTTTTGG ACTTCATCAC GTTATAAACA GTCTTGTCTG GTTTGTTTTT GGTACATTTA AGACAGCAGC TGGCGAAGTT TTAACTGGCG ATTTAAACAG GTTCTTTGCA GGAGACCCGA CTGCAGGTAT CTTTATGGCC GGATTTTATC CGATTATGAT GTTTGGACTT CCAGCAGCAG CACTGGCTAT GTGGGCAGCA GCTAGGCCGA ATCAAAGAAA AGTTGTGTCT GGTGTGTTAA TAAGTGCAGC TCTTACAGCC CTTTTAACGG GTATTACTGA ACCCATCGAA TTTGCATTTA TGTTCTTGGC ACCAGTACTT TATGTCATAC ACGCCCTTTT GACAGGTTTG TCTCTCACTA TAACTTATAT TCTTGGAATA AAAGCTGGTT TTGGTTTTTC GGCTGGTCTT ATTGACTATG TATTAAGCTA TGGCATATCT ACCAAACCGC TCTTACTTCT TTTGATAGGT ATCATATACG GAGCAATATA CTATGTAATT TTCTATTACG TAATAGTAAA ATTCAATTTG CCAACTCCTG GCAGATTAGA AGAAGAAGCA ACAGATCAGT ATAAAGAATT GTCAAAATCA GAAATTGGAG GTATTGCTGC ACAATACGTA GAGGTATTGG GTGGAGCAGA AAATATACAG TCTTTGGAGG CTTGCATAAC AAGATTGCGC TTGACTGTAA AAGACGATAC AATAATAGAC GATGATAAAC TTAAAAAGTT AGGGGCGACA GGCGTAATGA GAATGGGCAA AAATGCATTG CAAGTAATTG TTGGTACAAA AGCTGATTTG ATTGCACAAG AAATGAAAAA ACACATGAAA AAAGCAGGAG GTAAAATTTA A
|
Protein sequence | MKWLASVQKL GKALMLPVAV LPAAALLLRL GAPDVFNIPF IMQAGAAVFD NLPLIFAIGI AIGFAEGDGV AALAAAVGYF VLTKGATTIN KDINMGVLGG ILMGIIAGYL YNKYHDIKLP DFLGFFGGKR FVPIVTAFAA IVLALVMGYV WPPIQNGIYA LGEWIIGAGA LGVFVYGVLN RLLIPFGLHH VINSLVWFVF GTFKTAAGEV LTGDLNRFFA GDPTAGIFMA GFYPIMMFGL PAAALAMWAA ARPNQRKVVS GVLISAALTA LLTGITEPIE FAFMFLAPVL YVIHALLTGL SLTITYILGI KAGFGFSAGL IDYVLSYGIS TKPLLLLLIG IIYGAIYYVI FYYVIVKFNL PTPGRLEEEA TDQYKELSKS EIGGIAAQYV EVLGGAENIQ SLEACITRLR LTVKDDTIID DDKLKKLGAT GVMRMGKNAL QVIVGTKADL IAQEMKKHMK KAGGKI
|
| |