Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tter_2749 |
Symbol | |
ID | 8640778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobaculum terrenum ATCC BAA-798 |
Kingdom | Bacteria |
Replicon accession | NC_013526 |
Strand | - |
Start bp | 936247 |
End bp | 938184 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003324458 |
Protein GI | 269839765 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.655087 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAACG CAGACGAAGT CCGAAGTCCC AATGATCCCA GGCTGACCCG CCGGCAGCTC ATAGAAGCGA TGCTGGCCTC CGGTGGCGTG CTGGCCGCTG GCCCTCTGCT GGCCGCCTGC GGTGGGGGCA CCCAGGCTAC CCCGACACCC CCAAGTGGTC AAGCTCCTGC AGCGAGCGCC CCGCCTGCTT CGGCCAGCAG CCCGAGCCCA AGCCCGACCG CCTGCGCTAC CTGTCCCTCC CCTGGAGCCT CCCCAGGCGT GACTCCCACC GCTGGGGAGA AGCAGGCTGA GGAGGTGCCG CCCCAGCAGC CACAGAGGGG AGGCACCATT CCTACCCCCA GGGAGCAGAC GGTCGTCGTA GACCAGGCTC CATTCACCAT CTTCGACAGC TTCAACCCAT TCATTCCCAA CGGTTGGCAG TACAACGGAG GGGCGCAGCA GATCCTCGTG GAAAACCTCT TCTACTGGTT GCCTACTGGC GAGATCAAGC CCTGGCTGGG CAAGGAGTGG AAGTACAACT CCGACTACTC CGAGCTGACC CTTACCCTCA ACCCCAAGGT GAAGTGGAAC GATGGTCAAC CCTTCACATC GGAGGACGTC AAATACAGCC TGGAGATCCA GCAGGATCCG GCGTTATTCG GTAGCGGCAT CAAGGAGTTT GATAGCGTTG AGACTCCTGA CGAGCATACA GTGGTGATCA AGCTCACCAA GCCGAATCCT CGCTACCACC AGCGTTTCAT CTGTGGTATT ATCGGCTCCT TCATCGTGGT GCCCAAGCAT ATATGGTTCA AGCACAATCC CAAGACCTTC AAGAACAACC CGCCGGTCTT CACTGGCCCC TACGTTCTGG ACAGGGCTAT ACCATCGCAG CTGATGTTCG TGTGGAAGAA GAACCCCAAC TACTGGAACA AGGACGAGTT TGATCCCGCT CCTCAGTACG TGGTGTACCG CACAGCGCCC GTGCCGGACT CGGAGGTCCA GGAGTTCAAG CGGGGCACGG TGGATGTGGC TTCGTTCGAC TACACGCACG CGATGGCCGT CAAGAACGGC GGCTACCAGA ACATCATCAT TACCACGAAG TTCAGGGATC CATGCCCGCG CGGCATTCAC GTCAACTGCG ATCCCTCCAA GCCGCTGCTG TCGGACCCGC GGGCGCGCTG GGCAATCTCC TACCTCATAG ACAGGCAGAA GATAGGTACG GCCGTGTGGT TCGTGCCCAC CCCGCCCGCG CAGTATCCTT GGGCGGACTA CGAGTCCAAC AAGAAGTGGT CCAACGACTC TATAGCTTCC AAGTATCAGC TGACCTACGA TCCCAAGAAG GCGGAGCAGC TGTTCGATGA GCTGGGTGCG AAGAAGGACT CCAGCGGCAG GCGTATGTAC AAGGGCAAGC AGGTAGAGAT CAACATCATC ACCCCGGTAG CCGTAGGGCA GCCCGAGTAC GTGATCGGCC AACTGCTCTC GCAGGAGCTA CAGAAGATCG GGATCAAGTC GAGCGTGAGG GCGCTCTCTG GCAGCGTGTT CAGTGACAAG CAGAGCAAGG GACAGTACGA CCTGCTCTCG CTATGGATCT GCGGAGAGCT GTTTGATCCC GCCCAGTTCT ACTCCCAGTT CGACAACCGA CAGTATAAGC CGATCGGCAA GGATGCCTCT TCGGATCAGG TGAGGCTCAA GGACCAGAAG TTTCAGGAGC TGATCCTCAA GCTCAACAAC GCTAATCCCG ACGATCCGAA GAACAAGCCG CTATACGACC AGGCGCTGGA GCGGTTCTAC CAGCTCCTGC CGGTCATCCC GATCATCCAG ACCACTTATC CGAAGGTGTA CAACACCACC TACTGGACAG GCTGGCCCAC CGATGACAAC GTGTACACGG TGCCCCTGGA GTGGTGGGGC CAGTTCATGT TCGTGATCGG CAACCTGAAG CCGGCCAAGA GTTCGTAG
|
Protein sequence | MSNADEVRSP NDPRLTRRQL IEAMLASGGV LAAGPLLAAC GGGTQATPTP PSGQAPAASA PPASASSPSP SPTACATCPS PGASPGVTPT AGEKQAEEVP PQQPQRGGTI PTPREQTVVV DQAPFTIFDS FNPFIPNGWQ YNGGAQQILV ENLFYWLPTG EIKPWLGKEW KYNSDYSELT LTLNPKVKWN DGQPFTSEDV KYSLEIQQDP ALFGSGIKEF DSVETPDEHT VVIKLTKPNP RYHQRFICGI IGSFIVVPKH IWFKHNPKTF KNNPPVFTGP YVLDRAIPSQ LMFVWKKNPN YWNKDEFDPA PQYVVYRTAP VPDSEVQEFK RGTVDVASFD YTHAMAVKNG GYQNIIITTK FRDPCPRGIH VNCDPSKPLL SDPRARWAIS YLIDRQKIGT AVWFVPTPPA QYPWADYESN KKWSNDSIAS KYQLTYDPKK AEQLFDELGA KKDSSGRRMY KGKQVEINII TPVAVGQPEY VIGQLLSQEL QKIGIKSSVR ALSGSVFSDK QSKGQYDLLS LWICGELFDP AQFYSQFDNR QYKPIGKDAS SDQVRLKDQK FQELILKLNN ANPDDPKNKP LYDQALERFY QLLPVIPIIQ TTYPKVYNTT YWTGWPTDDN VYTVPLEWWG QFMFVIGNLK PAKSS
|
| |