Gene Tpen_1638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1638 
Symbol 
ID4600917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1584637 
End bp1586226 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content58% 
IMG OID639774411 
Productextracellular solute-binding protein 
Protein accessionYP_921036 
Protein GI119720541 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.343389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGTAC CCAAACTAGG TAAAAAGGCT GTAGCGGTAG CGGTCGTCGC TCTGGTAGCG 
GTCGTCGCCG CCTACGTCGC GCTGGCGCCC AAGCCTCCGA AGTACAAGAA CGTGATAGTG
ATAGGTACCA CGGACTCCGT GCAGACAACG CTCGACCCGT GCGAGGCCTA CGACTACCTC
GGCGTAAACA TAATCCAGAA CATGGGTGAA GGTCTCTTCG GCTACGAGCC CGGCACAGCC
AAGATAGTGA ACAAGCTCGC CGTGAGCTAT ACGATCAGCC CGGACAGGAA GGTCTGGGAG
ATAAAGATCA GGAGGGACGC GAAGTTCCAG GACGGAACCC CCCTCACCGC GCAAGCCGTG
GTCTGGTCCT GGGAGAGGAC GATAAAGCTG AACCAGGACC CAGCTTTCCT GATAGCCGAC
CTCATAGAGA AGGCAGAGGC GGTGTCCGAC GACACTATAC GCGTGTACCT GAAACAACCC
TTCGAGGAGA CCTACGTGAA GTCGCTGCTA GCTACGTGGG TAGCCTTCCC CGTGAACCCG
AAAACCACGC CGATGGAGGT GGTCAAGCCT CCCAACACCG TGGACATGAT AGGGCCGTAC
AAGCTCGCGA GCTGGAAGCC CGGCGAGTAC ATCGAGCTCG TAGCGAACCC GAACTACTAC
GGCGAGAAGC CTAAAACCGA GAGGATTATA ATCCGCTTCT ACAAGGACGC CCAGTCCCTC
AGGCTGGCCG TGGAGAACGG GGAGGTGGAC GTCGCGTTCA GGACGCTCTC GCCTGCGGAC
GTTAAGGACA TAATCGCCAA GGGCCAGCTC CAGGTCGTAA GCGGGCCGGG CATAGCCCCC
ATAAGGTACC TCGTCTTCAA CGTGAAGAAG GAGCCCTTCA ACGACAAGAG GGTGAGGCAG
GCCATAGCCT ACCTGATAGA CAGGAACCAG ATAGTGAACA CCGTGTTCAT GGGGACATTC
GCCCAGCCGC TCTACAGCAT GGTCCCCATA GGCTTCATAG GCCACAAGGA CAGCTTCAAG
GAGAGGTACG GCGACAAGCC TAACGTCGAG GCGGCGAGGC AGCTGTTAAG GCAGGCGGGC
TTCAGCGAGT CCAACCCGCT CAAAATGGAG CTCTGGTTCA CGCCGGTGAG GTACACGCCT
GCCGAGCCGG ACATAGCCGC CATAATAAAG CAGAACCTCG AGGCCAGCGG GATGATAAAG
GTAGAGCTGA AGAGCGCCGA CTGGGCTACG TACAGGTCTC TCTTCAAGCA GGAGAAGATA
GCCGTCTGGC TGCTCGGGTG GTTCCCGGAC TTCCTCGACT CGGACAACTA CGTGAGGCCT
TTCTACCACT CCGGGGCTAA CGGCTGGCTT CACGTGAACT ACAAGAACCC CGTCGTGGAC
GAGCTGATAG ACAAGCAGGT CTTGCAGCCG ACGGAGGAGA GGATTAAGAC CCTCGCGCAG
ATACAGGACA TAGTCGCCGA CGATGCGCCG ATAATCCCGC TCTGGCAGGA GGGTCAGTTC
GCTGTCGCAC AGAAGAACGT GAAGGGGATA GTGCTGGACT ACTCCCAGAT ATTCAGGTAC
TACCTCATAT ACGCCGAAGT GCAGGGCTAG
 
Protein sequence
MPVPKLGKKA VAVAVVALVA VVAAYVALAP KPPKYKNVIV IGTTDSVQTT LDPCEAYDYL 
GVNIIQNMGE GLFGYEPGTA KIVNKLAVSY TISPDRKVWE IKIRRDAKFQ DGTPLTAQAV
VWSWERTIKL NQDPAFLIAD LIEKAEAVSD DTIRVYLKQP FEETYVKSLL ATWVAFPVNP
KTTPMEVVKP PNTVDMIGPY KLASWKPGEY IELVANPNYY GEKPKTERII IRFYKDAQSL
RLAVENGEVD VAFRTLSPAD VKDIIAKGQL QVVSGPGIAP IRYLVFNVKK EPFNDKRVRQ
AIAYLIDRNQ IVNTVFMGTF AQPLYSMVPI GFIGHKDSFK ERYGDKPNVE AARQLLRQAG
FSESNPLKME LWFTPVRYTP AEPDIAAIIK QNLEASGMIK VELKSADWAT YRSLFKQEKI
AVWLLGWFPD FLDSDNYVRP FYHSGANGWL HVNYKNPVVD ELIDKQVLQP TEERIKTLAQ
IQDIVADDAP IIPLWQEGQF AVAQKNVKGI VLDYSQIFRY YLIYAEVQG