Gene Tpen_1550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1550 
Symbol 
ID4600843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1498573 
End bp1499859 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content60% 
IMG OID639774324 
Productextracellular solute-binding protein 
Protein accessionYP_920949 
Protein GI119720454 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCAGA AAAAGGTACT GGTAGCAGCG GTAGTAGCAC TAGTTGTCCT GGTAGCCCTC 
GCCGCCTACG TTCTCTACCG GCCGAAGCCG AAGTCTGTCA CCCTAACGTT CGTCTCGACG
CAGCTCAGCC CTCCCACCGA GCAAGCTTTC ATGAGGTCTC TACTATCCCG GTTCGGGAAC
GAGACGGGCA TAAAGGTCGA CTTCGTGCCG CTAGGCTACA CCGACATGGT GGCGAAGGTA
GAAGCAGAGG TCAACTCCGG GAAGGTCTCG ACCAACATAA TCGGAGGTCT CTCCTCGGAG
GTGGACTACT TCGCCAGCAA GGGGCTTGTA GAGGATCTTT CTAAGTTCGG CTCCCTACCC
GGCAGGACGT TCTACCCAGC GCTCGAGCAG GCCTCGAAGA TGTACGGCAT TAAGGCCATG
GTTCCCTGGA TGACCGCGAC CTTCGTTATC GTGGTGAACA ACAAGGCCTT CGACTACCTG
CCGCCGGGCC TTACGAAGGA CGACGTGATA AAGGGAACCG ACAAGTGGAC GTACGACGCG
CTCCTAGCGT GGGCCAAGAA CATCTACGAG AAGACCGGGA AGCAGGCAGT AGGGTTACCG
GCGGGGGGCG GAGGGCTCCT CCATAGGTTC CTCCACGGCT ACCTCTACCC GTCCTACACG
GGGTACCAGG CGAAGGCGTT CAACTCCCCG GAGGCCGTGG AGCTTTGGAA GTACCTGAGG
GAGCTCTGGA AGTACACGAA CCCTGCCAGC ACGACGTACG ACGCAATGGC GGACCCCTTG
CTCAAAGGTG ACGTCTGGAT CGCGTGGGAC CACGTCGCCC GGGTGAAAGC CGCTATAACG
ACCTCGCCCG ACCAGTTCAC GGTGTGCCCC GTGCCGCGGG GACCCAAAGG GAGAGGCTAC
ATAGTGGTTC TAGCAGGGCT CGCTATACCC AAGGGAGCCC CCGACCAGGA CTCGGCGTGG
AAGCTCGTAG AGTTCCTGAC GAGGCCAGAG GTTCAGGCCA AGGTGGCCGA GAACGTAGGC
TTCTTCCCGA CAGTTAAGGA GGCAAGCCCG GCGATAACGG GTCCCATAAA GAAGCTGGCT
GACGGTGTGT CCGCTCAGAT GGCGGCCCCA GACTCCATAG CGGTTATGAT ACCGCCGCTC
GGCGCCAAGG CTGGAAGCTT CAACTCCGTG TACCGCGATG CCTTCACGAG GATCGTGCTC
AAAGGAGAGG ACATCCAGGC GGTTCTAGCC GACGACGCCG GAAAGCTTGA CAGCATATTC
AAGGAGCTGA AGATACCGCC TCCCTAA
 
Protein sequence
MPQKKVLVAA VVALVVLVAL AAYVLYRPKP KSVTLTFVST QLSPPTEQAF MRSLLSRFGN 
ETGIKVDFVP LGYTDMVAKV EAEVNSGKVS TNIIGGLSSE VDYFASKGLV EDLSKFGSLP
GRTFYPALEQ ASKMYGIKAM VPWMTATFVI VVNNKAFDYL PPGLTKDDVI KGTDKWTYDA
LLAWAKNIYE KTGKQAVGLP AGGGGLLHRF LHGYLYPSYT GYQAKAFNSP EAVELWKYLR
ELWKYTNPAS TTYDAMADPL LKGDVWIAWD HVARVKAAIT TSPDQFTVCP VPRGPKGRGY
IVVLAGLAIP KGAPDQDSAW KLVEFLTRPE VQAKVAENVG FFPTVKEASP AITGPIKKLA
DGVSAQMAAP DSIAVMIPPL GAKAGSFNSV YRDAFTRIVL KGEDIQAVLA DDAGKLDSIF
KELKIPPP