Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_1115 |
Symbol | |
ID | 5876813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | + |
Start bp | 1157978 |
End bp | 1159297 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641541469 |
Product | extracellular solute-binding protein |
Protein accession | YP_001662749 |
Protein GI | 167039764 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000327347 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAGAA AAATTTTATC CATTATGTTG ACGTTTGCAT TGGTTTTTTC GCTCATGGCA GGCTGTGGTA CAAAAAGCAG CGATAATGGA GAAAGTAATA GTACTGCCAC GGCAACAAAG ACTGTAAAAA TTACTTTGCT AAATTCTAAG GGGGAAATTC AGGCTCAATT AGAAGATGCA GCTAAAGCTT TTACAAAAGA AAATCCGAAT ATTACTGTAG AAGTCATTCC TGCAACAGCC GGTCAGTCAC CTTTTGAAAA GGTTACCTCC ATGTATGCAT CTGGTAATGC ACCGACAATG GCAATGTTAG ATCCAGGTGA TATAGCAAAA TTCAAAGATA AATTCTTAGA TTTAAGCAGT GAAAAGTGGG TTTCAGATGC AATAGATGGT GCTTTAAATG CAGCTACAGT GGATGGAAAG GTTATAGCGT TTCCGTTTGC TGTTGAAGGA TATGGACTTA TTTACAACAA GGCTGTACTG GACAAGGCTT ATGGTGGAAA CTTTGATCCG AGTTCTATAA AGACGAGAGA TGCTTTAGAA GAAGCATTTA AAAAAGTAGA AGCAACAGGT GCTAAAGCAC TAGAAATTTC TCCAATGGAT TGGTCTTTAG GCGCACATTT CCTTTCAATA GCGTATGCGG ATCAATCTAA AGATCCTGCT CAAGTAGCTC AATTTTTATC AGACTTAAAA GCGGGAAAAG TTGATTTAGC AAATAATAAA GTTTTTAACG GTTTAATGGA TACTTTTGAC ATGATGAAAA AGTACAACAT AGATAAAAAT GATCCATTAT CTCCGACTTA TGATAGAGGA CCAGAGCTTA TTGGTAAAGG TGAAGTTGGA TTTTGGTTTA TGGGAAATTG GGCATGGCCA CAGATAAAAG AATTTGATAC TGCAAATGGA CAATACGGCT TTATACCTGT ACCAATCAGC AATAACCCAG ATGACTATGG TAATTCAGGT ATACCTGTAG GTGTAACAAA ATTTATCGGC ATAGATAAAA CACAAAATAG TGCTGAGCAG CAAGATGCAG CTAAGAAATT TTTAGATTGG TTGGTATACA GCTCTACAGG TCAAGACATG CTTGTGAACA AACTTAACAT TATACCTGCA TTTAAAAATA TAACTTTACA ACCGCAAGAT CCCCTTGCTA AATCTATTTT GCAGTATGTT AAGAGTGGTA ATACTTTAGA GTTTATGACT ACATTGCCAC CTGACCACTG GTCAAAGTTA GGAGCTTCAA TGCAAAAGTA TTTGGCAGGG AAAATTGACA GAAAAGGCTT GATTGATGAA ATAGAAAATT ATTGGAAAAA TGTTCAATAA
|
Protein sequence | MKRKILSIML TFALVFSLMA GCGTKSSDNG ESNSTATATK TVKITLLNSK GEIQAQLEDA AKAFTKENPN ITVEVIPATA GQSPFEKVTS MYASGNAPTM AMLDPGDIAK FKDKFLDLSS EKWVSDAIDG ALNAATVDGK VIAFPFAVEG YGLIYNKAVL DKAYGGNFDP SSIKTRDALE EAFKKVEATG AKALEISPMD WSLGAHFLSI AYADQSKDPA QVAQFLSDLK AGKVDLANNK VFNGLMDTFD MMKKYNIDKN DPLSPTYDRG PELIGKGEVG FWFMGNWAWP QIKEFDTANG QYGFIPVPIS NNPDDYGNSG IPVGVTKFIG IDKTQNSAEQ QDAAKKFLDW LVYSSTGQDM LVNKLNIIPA FKNITLQPQD PLAKSILQYV KSGNTLEFMT TLPPDHWSKL GASMQKYLAG KIDRKGLIDE IENYWKNVQ
|
| |