Gene Tpen_1055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1055 
Symbol 
ID4601441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp994302 
End bp995558 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content59% 
IMG OID639773833 
Productextracellular solute-binding protein 
Protein accessionYP_920458 
Protein GI119719963 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.404897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGGCGC AGGAAAAGGG AAAGAAGGGA GTAAACAAGC TTCTTATCGC CGTTGCTGTA 
CTGGTTATCG TAGCACTAGC GGCTGTAGCC CTCTACCCGA TGTTCGCTCC TAAACCGGCG
CCTAAACAGG TAAAGATAAC TATATGGACG GCGTGGACTG GCGGAGAGTA CGACGCACTT
AAAGCCGTTA TAGACGATTT CAGGGCGAAG AACCCCAACT ACCAGATAGA CATAGTCAAC
GTCCCCTTCG ACCAGCTCAA GAACAAGGTT ATACAGGCGG TACCGGTCGG CGAAGGGCCG
GACCTCTTCA CCGGCCCGCA CGACTGGACG GGCGAGCTCG TCCAGGCGGG CGCTCTCGTA
GACATCACGG ACAAAGTCTC CGCCTTTAAG GGAGAGTACA TGGAGAGCGC CCTCCAGGGA
GTAACGCTGA AGGGCAAGAT CTACGGGCTA CCCGAGAGCA TCAAGCTACC GGCCTTGATA
GTGAACAAGA AGCTCCTAGC CACTCCCCCG AAGACGCTCG ACGAGCTGTG GAGCATAATG
GACCAGTTCA AGGCTAAGGG GATGTACGGC CTCGCGTACG ACGTGCAGAA CGCGTACTTC
AGTAGCTGCT GGTTCTACGG GCTCGGAGCC TACTACCTCG ACCCCAACAC CCTGGAGACA
GCGCTGGACA GCCCCGGCGC CGTCCAGGCG TTCCAGATAA TCGCCAAGTT CAGCAAGTAC
CTCCCGCCGG ACATCAGCTA CGACATGATG ACCAACCTCT TCATGAACGG TAAAGCCGCG
ATGGCCATAA ACGGTCCATG GTGGATCGGA GACCTCAAAA AGGCTTTCGG CGAGAACCTC
GCGGACATCG AGATAACCCT CATACCGGCT ATAGACCCGG CGCACCCGGC CAGGCCCTTC
ATGACGGTGG AGGCGGTCTT CGTGACTAAG AACGCCGCCG AGAGGGGCGT CCTCGACGAA
GCAATAGCGC TGGCGCACTA CATAACGGGC GAAGCCTCCG TAAAGCTCGC CAAGATGGCT
GGACACGTAC CCACGTGGAA GAACGCCATG AAGGACCCGG CAGTCTCCGG TGACAAGGTT
ATCAGCGCGT TCTTCAAGCA GGCCGAGTAC GGCGTACCGA TGCCGAACGT ACCCGAGGTC
GCGCAGATGT GGAACGTCGT GCCGAAGTAC ATAAGCCAGG TCTACCAGGG ACAGCTATCC
CCGCAGGACG CGGCGAAGGC GGCGGCCCAG GAGCTAAGAG CGGCCCTGAA GAAATGA
 
Protein sequence
MAAQEKGKKG VNKLLIAVAV LVIVALAAVA LYPMFAPKPA PKQVKITIWT AWTGGEYDAL 
KAVIDDFRAK NPNYQIDIVN VPFDQLKNKV IQAVPVGEGP DLFTGPHDWT GELVQAGALV
DITDKVSAFK GEYMESALQG VTLKGKIYGL PESIKLPALI VNKKLLATPP KTLDELWSIM
DQFKAKGMYG LAYDVQNAYF SSCWFYGLGA YYLDPNTLET ALDSPGAVQA FQIIAKFSKY
LPPDISYDMM TNLFMNGKAA MAINGPWWIG DLKKAFGENL ADIEITLIPA IDPAHPARPF
MTVEAVFVTK NAAERGVLDE AIALAHYITG EASVKLAKMA GHVPTWKNAM KDPAVSGDKV
ISAFFKQAEY GVPMPNVPEV AQMWNVVPKY ISQVYQGQLS PQDAAKAAAQ ELRAALKK