Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1550 |
Symbol | |
ID | 4600843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1498573 |
End bp | 1499859 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639774324 |
Product | extracellular solute-binding protein |
Protein accession | YP_920949 |
Protein GI | 119720454 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCAGA AAAAGGTACT GGTAGCAGCG GTAGTAGCAC TAGTTGTCCT GGTAGCCCTC GCCGCCTACG TTCTCTACCG GCCGAAGCCG AAGTCTGTCA CCCTAACGTT CGTCTCGACG CAGCTCAGCC CTCCCACCGA GCAAGCTTTC ATGAGGTCTC TACTATCCCG GTTCGGGAAC GAGACGGGCA TAAAGGTCGA CTTCGTGCCG CTAGGCTACA CCGACATGGT GGCGAAGGTA GAAGCAGAGG TCAACTCCGG GAAGGTCTCG ACCAACATAA TCGGAGGTCT CTCCTCGGAG GTGGACTACT TCGCCAGCAA GGGGCTTGTA GAGGATCTTT CTAAGTTCGG CTCCCTACCC GGCAGGACGT TCTACCCAGC GCTCGAGCAG GCCTCGAAGA TGTACGGCAT TAAGGCCATG GTTCCCTGGA TGACCGCGAC CTTCGTTATC GTGGTGAACA ACAAGGCCTT CGACTACCTG CCGCCGGGCC TTACGAAGGA CGACGTGATA AAGGGAACCG ACAAGTGGAC GTACGACGCG CTCCTAGCGT GGGCCAAGAA CATCTACGAG AAGACCGGGA AGCAGGCAGT AGGGTTACCG GCGGGGGGCG GAGGGCTCCT CCATAGGTTC CTCCACGGCT ACCTCTACCC GTCCTACACG GGGTACCAGG CGAAGGCGTT CAACTCCCCG GAGGCCGTGG AGCTTTGGAA GTACCTGAGG GAGCTCTGGA AGTACACGAA CCCTGCCAGC ACGACGTACG ACGCAATGGC GGACCCCTTG CTCAAAGGTG ACGTCTGGAT CGCGTGGGAC CACGTCGCCC GGGTGAAAGC CGCTATAACG ACCTCGCCCG ACCAGTTCAC GGTGTGCCCC GTGCCGCGGG GACCCAAAGG GAGAGGCTAC ATAGTGGTTC TAGCAGGGCT CGCTATACCC AAGGGAGCCC CCGACCAGGA CTCGGCGTGG AAGCTCGTAG AGTTCCTGAC GAGGCCAGAG GTTCAGGCCA AGGTGGCCGA GAACGTAGGC TTCTTCCCGA CAGTTAAGGA GGCAAGCCCG GCGATAACGG GTCCCATAAA GAAGCTGGCT GACGGTGTGT CCGCTCAGAT GGCGGCCCCA GACTCCATAG CGGTTATGAT ACCGCCGCTC GGCGCCAAGG CTGGAAGCTT CAACTCCGTG TACCGCGATG CCTTCACGAG GATCGTGCTC AAAGGAGAGG ACATCCAGGC GGTTCTAGCC GACGACGCCG GAAAGCTTGA CAGCATATTC AAGGAGCTGA AGATACCGCC TCCCTAA
|
Protein sequence | MPQKKVLVAA VVALVVLVAL AAYVLYRPKP KSVTLTFVST QLSPPTEQAF MRSLLSRFGN ETGIKVDFVP LGYTDMVAKV EAEVNSGKVS TNIIGGLSSE VDYFASKGLV EDLSKFGSLP GRTFYPALEQ ASKMYGIKAM VPWMTATFVI VVNNKAFDYL PPGLTKDDVI KGTDKWTYDA LLAWAKNIYE KTGKQAVGLP AGGGGLLHRF LHGYLYPSYT GYQAKAFNSP EAVELWKYLR ELWKYTNPAS TTYDAMADPL LKGDVWIAWD HVARVKAAIT TSPDQFTVCP VPRGPKGRGY IVVLAGLAIP KGAPDQDSAW KLVEFLTRPE VQAKVAENVG FFPTVKEASP AITGPIKKLA DGVSAQMAAP DSIAVMIPPL GAKAGSFNSV YRDAFTRIVL KGEDIQAVLA DDAGKLDSIF KELKIPPP
|
| |