Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_2152 |
Symbol | |
ID | 5878089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | - |
Start bp | 2155188 |
End bp | 2156771 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 641542507 |
Product | extracellular solute-binding protein |
Protein accession | YP_001663760 |
Protein GI | 167040775 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000101547 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAT TTATTATATT ATCTTTCATT TTGATGTTTA TTTTTACAGG ATGCAAAACC ACTCCACAAA ATATGGTAAA TTTGCCGCAA GCTTCTGAGG ATAAAACTAA GATTGAAGAA GAAAAGCCAC TAGAGGGTGG CACTTTAAGA GTAAATATTA CTTCTTTTGA TACGTTAAAT CCTTTTTTAA ACAGCAATGA ACGAGTAAGG CAAATGTTGA ATTTGTCATT GGAAGGATTA GTTACTTTAG ATAAAACTTT AAAACCTGTT CCTCAATTAG CAGAAAAGTG GGAGATAAAT GGACTTAATA TAAAATTTTA TTTAAAAAAG AATGTTAAAT GGCAGGATGG AGTAAGTTTT ACAGCTAATG ATGTAAAATT TACTTTTGAT TCTTTTAAAA GTAAGGAGGT AAAAAGCCCT TATAAGGACA TTCTCATTAA CTATCTCTCC TCTTATAAGA CAAACGGTGA TTATGAATTT GAAGTTGTTT TAAATAAACC TGCTGCAAAT CCAATAGCTT TATTTACTTT TCCTATTTTA GCCGAGCACC AATACAAGAG CAAGGAAGAT ATTTTAAATA AAGACATCGT ACCTATTGGT ACAGGACCAT ATAAGATATC TTCTTACAGT GTTAGCAGAG AGGTTATTTT TGAAAAAAAT CCTTATTATA GAGGCGACAA ACCCTATATA GATAATATAG TATTTAAAAT TGTGCCTAAT GAAAATGCCA TGATAACATC TTTTCAAAGC AAAGAAGCAG ATTTTACTTT TTTAAGTGAC ATAGACTGGG ATAAATACAA AGAGCTTTCA AATGTAAATA TTTATAAATA CGTTATGCAA GATTATGTAT TTATGGCTCC TAATTATCAA AATCCAGCGT TGAAAGATTC AAATGTCAGG AAAGCGATGT GTTATGGAAT AGATACTGAC AAAATTTTGA GAGATGTATA TTTTGGTCAT GGGTTAAAGT CATGCATCCC AGTCCGCCCT GATTCATGGC TTTATAGTAG TAAAATAGTA GCTCATAATT ACGATATAAA AGAGGCAAAT AAAATATTAG AGGAAAATGG TTGGAATTTA GTTAAAGGGA TAAGGAACAA TGGAACTTAT CAACTTAAAT TTGATTTGAT AGTAAATGTA AATAATCCTT ATTTAATTAA GACTGCTCAA ATTATAAAGA ATAATTTAAA AAGCATAGGA ATTGAAATAA AAATTGTACC AAAAGATTGG GATAATTTAC TAAGTTCTGT GTATTCGGGT AAGTTTGATT TGGTGCTAAT GGAGTGGAAT TTAAGTTATA ATCAAGATAT GTCTGCGATG TTTATGACAA AAGGCAAAGA TAATTTTATG GGTTATAGTA ATTCTAAAGT TGATGAAATA TATAGTAGAA TTTTTTATGA TTTAGAAGAA AATTCTTTAA AAGCAGATTA TCAAGTTTTG GAACAAGTTT TTTTAGAGGA ACAACCTATA ATTGGCCTTT TCTATATTGA AGGAGCAGTT ATGGCATATG ACAATGTAAA AGGTGTGGAC CCTACTGGAT TTAATGTTTT CGACAATATA GAAAAATGGT ATATAAAAAA GTAG
|
Protein sequence | MRKFIILSFI LMFIFTGCKT TPQNMVNLPQ ASEDKTKIEE EKPLEGGTLR VNITSFDTLN PFLNSNERVR QMLNLSLEGL VTLDKTLKPV PQLAEKWEIN GLNIKFYLKK NVKWQDGVSF TANDVKFTFD SFKSKEVKSP YKDILINYLS SYKTNGDYEF EVVLNKPAAN PIALFTFPIL AEHQYKSKED ILNKDIVPIG TGPYKISSYS VSREVIFEKN PYYRGDKPYI DNIVFKIVPN ENAMITSFQS KEADFTFLSD IDWDKYKELS NVNIYKYVMQ DYVFMAPNYQ NPALKDSNVR KAMCYGIDTD KILRDVYFGH GLKSCIPVRP DSWLYSSKIV AHNYDIKEAN KILEENGWNL VKGIRNNGTY QLKFDLIVNV NNPYLIKTAQ IIKNNLKSIG IEIKIVPKDW DNLLSSVYSG KFDLVLMEWN LSYNQDMSAM FMTKGKDNFM GYSNSKVDEI YSRIFYDLEE NSLKADYQVL EQVFLEEQPI IGLFYIEGAV MAYDNVKGVD PTGFNVFDNI EKWYIKK
|
| |