Gene Tpen_1174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1174 
Symbol 
ID4602040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1115043 
End bp1116551 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content56% 
IMG OID639773950 
Productextracellular solute-binding protein 
Protein accessionYP_920575 
Protein GI119720080 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGA AAAAAGGCCT GCAGAAGACC ACAGCGATAC TGCTCGTAGT AGTCCTGCTG 
GTAGGGCTAC TAGCCGGCTA CTTCATAGGA GTTTCCACTG CGCCGAAAGC CCCCGCGGAG
GAGGTAGTGC CCAAATCCCA GTACGAGCAG CTACAGAAGG AGCTCGAGTC CGTAAAGGCT
CAGCTACAGC AGATGGCCGC GCAGCAGGGC AAGCCTGTCG AGATAGTAAT CACCGCGTGG
ACTCAGGGGC CCGAAAGGGA GTCGATATAC AGGCAGCTGA ACCTCGTAGA AGCGGCTAAC
AGGCTGAACC AGATATTCAA GGTGGTGGGC GTCCCGGCGA CTGTCAAGGT TGAGGGAGAC
TTCTCCACGG CGTCTTGGAC GGATTACAGG AAGAAGGTAT TCCTGGCGCT TGAAGGCGGG
ACGGGGCCCT GCATATTCCA GATGGAGCAC GTGTGGTCTG CGGTTCTCGC TGAGAACGGG
TGGATAATCC CGCTCGACGA CTACGTGAAG AAGTACTGGA ACTGGACGTA CTATGACATC
ATCCCCGGCC TATGGTCCTC CGTGACCTAC AAGGGGAAGA TATGGGGCAT TCCGCAGGAC
ACGGAGGCCA GGCCGATCTA CTTCAACAAG CTCCTCCTCA AGAAGCTCGG GTGGACCGAC
GAGCAGATAA ACGCACTCCC TGAGAAGATC AGAAGGGGCG AGTTCACGCT CCAGGACATG
TTGATGGTAG CGAAGGAGGC TGTCGACAAG GGAGTCGTGG CGCCTGGCTA CGGCATCTGG
CACCGCCCGA CTGCTGGCCC TGACTGGCCC ATAGTATACC TGGCATTCGG AGGCAAGCTT
TACGACGAGA CCAGCGGGAA GCTAGTAGCG GACATGAAGG TTTGGAAGAA GGTCTTCGAC
TGGTTCTACG CGGCCTCGAT GCAGAAGTAT AAGGTGATAA CGGATAAGAT GACGTCGCTC
GACTGGAACA GGGACGTTCA CCCAACAATA GTAGCCGGTA AAGTATTGTT CTGGATGGGA
GGAACGTGGC ACAAGGGGCA GTGGGTCGGC TCCTTCAACC TCTCCGAGAG CAAGTTCTGG
GAAATGTTCG GCTTCGCCCT CTACCCCGCA GGCGAGCCGG GACTTAAGCC TGTCACCCTC
TCGCAACCAC AGGCTTACTT CATCTCGAAG ACATGCAAGT ACCCGGAGAT CGCGTTCCTC
ATAATAACCC TGGCTACCGA CCCCTACCTC AACTCGTTGC ACGCCGTTAA AAGCGCACAC
CTAGCGATAA TGTACCGGCA GCTGTCCGAC CCTGTATACA CGAAGGACAA GTTCCTCGCT
ATGACGGGCT ACATGGTAGA GTACGCGCAG TACCAGCCGA TGCACCCGAG ATGGGGAGAC
TACAACACGA TAATATTCAA TACGATAAAG GGTATCGAGA CAGGGCAGTT CGACGCCGAC
CAGGCTCTGC AGGTCTTCAA GCAGAACCTC CAGTCCACGC TTGGCGATAA CGTAATAATA
AAAGAATAA
 
Protein sequence
MSQKKGLQKT TAILLVVVLL VGLLAGYFIG VSTAPKAPAE EVVPKSQYEQ LQKELESVKA 
QLQQMAAQQG KPVEIVITAW TQGPERESIY RQLNLVEAAN RLNQIFKVVG VPATVKVEGD
FSTASWTDYR KKVFLALEGG TGPCIFQMEH VWSAVLAENG WIIPLDDYVK KYWNWTYYDI
IPGLWSSVTY KGKIWGIPQD TEARPIYFNK LLLKKLGWTD EQINALPEKI RRGEFTLQDM
LMVAKEAVDK GVVAPGYGIW HRPTAGPDWP IVYLAFGGKL YDETSGKLVA DMKVWKKVFD
WFYAASMQKY KVITDKMTSL DWNRDVHPTI VAGKVLFWMG GTWHKGQWVG SFNLSESKFW
EMFGFALYPA GEPGLKPVTL SQPQAYFISK TCKYPEIAFL IITLATDPYL NSLHAVKSAH
LAIMYRQLSD PVYTKDKFLA MTGYMVEYAQ YQPMHPRWGD YNTIIFNTIK GIETGQFDAD
QALQVFKQNL QSTLGDNVII KE