Gene Tpen_0145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0145 
Symbol 
ID4600637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp120098 
End bp121093 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content58% 
IMG OID639772899 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_919558 
Protein GI119719063 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCGAGG ATAAGGTTAT ACTGAAGGCG GAAAACCTGA AGAAGTGGTT CACTGTTCGG 
AGAGGTCTCT TCGGAGGCAC AGTGGAAGTA AAGGCGCTTG ACGGAGTTTC CTTCGAGTTG
AGGGCAGGCG AAGCCGTATC GCTCGTCGGC GAGTCGGGTA GCGGGAAGAC GACGCTGGGC
AAGACTATCC TCCGCCTCTA CGAGCCCACG GACGGTAAGC TCGTGTTCAA AGGTAGGGAC
ATAACGCACA CGCCCGAGAA GGAGCTAATG TGGTACAAGA GGGAGACCGG GCTCGTGCAG
CAGGATCCAT ACGGGGCCAT GCCGTCCTTC ATGAACATCT ACCGCATCCT CGAGGAACCC
CTAATCATAC ACAAGGTCGG GAGCAAGGAG GAGAGGGCTG AAATGGTTTT CAAGGCCCTC
GAAGAGGTAA GGCTCACCCC GGTAGAAGAT TTTGCGTACA AGTACCCGCA CATGCTTAGT
GGCGGCCAGC TCCAGAGAGT AGCGATAGCG AGGGCGCTCA TACTCAAGCC GAGCCTTGTC
GTAGCCGATG AGCCGGTATC GATGCTCGAC GCCTCCGTAA GGGTTGAGAT CCTCACGTTG
ATGAGGGACC TGCAAGAGAA GAGGAATATT AGCTTCATCT ACATCACGCA CGACCTATCG
ACGACGAGGT ACTTCAGCGA GTGGATCTTC ATAATGTACG CCGGCCACAT AGTGGAGAGA
GCCCCAACGA AGACCCTACT CAGGAACCCG TTGCACCCGT ACACGCGCGC CTTGCTCTCG
GCAATACCCG ACCCAGACCC CGAGAATAGG AAGAGGTACA GGGAGGTCCC GCCAGGAGAG
CCGCCGAGCC TCGTGAACCC GCCGCCCGGG TGCAGGTTTG CGCCCAGGTG CCCCTTCGCG
ACGAGCAGGT GCAGGAGCGA GGACCCGCCC GAAGTGGAGG TCGAGCCAGG CCACTACGTT
AAGTGTTGGC TCTTCGCCGG AGAAGCGAAG GCTTAG
 
Protein sequence
MSEDKVILKA ENLKKWFTVR RGLFGGTVEV KALDGVSFEL RAGEAVSLVG ESGSGKTTLG 
KTILRLYEPT DGKLVFKGRD ITHTPEKELM WYKRETGLVQ QDPYGAMPSF MNIYRILEEP
LIIHKVGSKE ERAEMVFKAL EEVRLTPVED FAYKYPHMLS GGQLQRVAIA RALILKPSLV
VADEPVSMLD ASVRVEILTL MRDLQEKRNI SFIYITHDLS TTRYFSEWIF IMYAGHIVER
APTKTLLRNP LHPYTRALLS AIPDPDPENR KRYREVPPGE PPSLVNPPPG CRFAPRCPFA
TSRCRSEDPP EVEVEPGHYV KCWLFAGEAK A