Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_1181 |
Symbol | |
ID | 5877967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | + |
Start bp | 1217218 |
End bp | 1218459 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641541531 |
Product | extracellular solute-binding protein |
Protein accession | YP_001662811 |
Protein GI | 167039826 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000595288 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGT ACACAAAAAT AATTGCATTG CTGACTGTGA TGGTATTCGT TTTGTCAGCT GCTTTAACAG GTTGTGGTGG CAGCAAGACA AGCGAAAATA CTTCATCAGG TCAGACAGAA CAAAAGAAAG AGCCTGTAGA ATTAATAGTA TGGTCCCACT TAACAGATCC TGAAATAGCA AAGGTTCAAG AGATTGCAAA TAAGTGGGCA GAGCAAACTG GCAACAAAGT AAAAGTTTTG GCAGACCAGA GTGACTTCCA AGCTTTCTCA ACAGCTGCTC AAAGTGGTAA GGGCCCAGAC ATCATGTTTG GTTTACCACA CGATAACTTG GGTACTTTCC AAAAGGCAGG ACTTTTAGCT GAAGTACCAG ACGGAGTTAT AAACAAAGAC GACTATGTTC CAATGAGCAT AAGTGCTGTA TCTTACGATG GAAAAATGTA TGCTGTACCA ATTTCAATGG AGACTTATGC ACTTTTCTAC AATACAGATA AAGTTCCAAC ACCACCAGCA ACACTTGATG ATTTAATTAA GCTGGGCAAA GAAGTAGGAT TCCAATACGA TGTAAATAAC TTCTATTTTA GCTTTGCCTT TATATCTGCT TATGGCGGTT ATGTGTTCAA GGATACAGGC GGTGGACTTG ATCCAAACGA TATAGGATTG AATAACGATG GTGCAAAGAA AGGCTTAGAA CTTATAAAAG ACTTTGTGAC AAAGTATAAA TTCATGCCTG CAGATATAAA TGGAGATATG GCAAAAGGTA ACTTCCAAAG TGGAAAGACT GGTTTGTATA TAAGTGGTCC ATGGGATGTG GATGGCTTCA AGAAAGCTAA TGTTCCATTC AAAGTTGCTC CACTACCACA GGTTGATGGT AAACCAATGC CTTCCTTTGC AGGTGTGCAA GCTGCTTTTG TAAGTGCAAA TTCAAAACAT CAACAAGAAG CATGGGATTT AATGAAGTAT CTTGCTGAAA ATACAGGATT GCCATTATTT GAAACAGGTA ACAGGATACC AGCTCTTAAA TCCCTCTTAG ACAATCCAGA AGTTAAGAAT AATGAGATTT TGAATGCATT TGCTGAGCAA GCTACACATG CTATTCCAAT GCCTAATATA CCGCAAATGG CAGCAGTCTG GACTCCTGCA GGCAATGCAT TACAGCTTAT TACATCTGGA AAAGTTCCGG TTGACAAAGC TGCTGACGAT ATGGTGAATC AAATCAAGCA AGGCATTGCA ACACAGCAAT AA
|
Protein sequence | MKKYTKIIAL LTVMVFVLSA ALTGCGGSKT SENTSSGQTE QKKEPVELIV WSHLTDPEIA KVQEIANKWA EQTGNKVKVL ADQSDFQAFS TAAQSGKGPD IMFGLPHDNL GTFQKAGLLA EVPDGVINKD DYVPMSISAV SYDGKMYAVP ISMETYALFY NTDKVPTPPA TLDDLIKLGK EVGFQYDVNN FYFSFAFISA YGGYVFKDTG GGLDPNDIGL NNDGAKKGLE LIKDFVTKYK FMPADINGDM AKGNFQSGKT GLYISGPWDV DGFKKANVPF KVAPLPQVDG KPMPSFAGVQ AAFVSANSKH QQEAWDLMKY LAENTGLPLF ETGNRIPALK SLLDNPEVKN NEILNAFAEQ ATHAIPMPNI PQMAAVWTPA GNALQLITSG KVPVDKAADD MVNQIKQGIA TQQ
|
| |