Gene Tpen_0248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0248 
Symbol 
ID4601475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp218299 
End bp219399 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content57% 
IMG OID639773002 
Productpeptidase M24 
Protein accessionYP_919661 
Protein GI119719166 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCGACT TTAAGCAGCA CGTGTCCAAA GTGGTTGAAA GGATCCTGGT ACCCAACGAT 
CTTAACTACC TAGTGGTGAT GTCGGCTTCT AACATCTTCT ACCTTTCGGG TAGCGACGCT
CCTTCCGCGC TTGTCGTGTC TAAAGAAGGA GAGGTCAGCG CGCTCGCCTC CCGCCTCGAG
TACTTCAGAG CAGTATCCGA GACAAGCGGC TTGCGCGTAG TGGCGTTTGC ACGCGAAGGG
GAAGACGTTA GCGAGTACGA GGAGGTTGTA CGGGGTGACT TCTACGAGGC GCTTTCACGA
ATGGTGTCCG GCAGCGAAAG GATCGGCGTT GTAGGGGCTT CCTGCGAGGC AAAGGAAAAG
CTGGCGGAGA AGACAGGGAA GCAGCTATAC GACTACTCCA AGGAGTTCTC CCTCATAAGG
CGCGTGAAAG ACCCCGGGGA GCTCGAAGCA ATAAACAGAG CTGCTCGGCT CGCAGAGCTG
GCTATGAGGA AGGCTCTAGA CACGCTGGAG CCAGGGGTCA CCGAGTCGGA GGTTGCCTCC
GAGATCCTGA AGGTCATAGT CTCCTCCGGT GCATATCCGT CGTTCCCACC CATAGTGGCC
TTCGGGGAGC ACGCGGCTCA CCCGCACGCG AAGCCTAGCC TGAGGAGGCT TATAAAAGGC
GACTTCGTAA AGATAGACCT GGGAGCTAAG GTTGACGGCT ACTGCTCGGA CATGACCAGA
ACCCTGGTCT TCGGCGAGCC GTCTGAGAAG CAGCGAAGAA TATTCGAGGC GGTGGTTAAA
GCTCAGGAAA GCGCGCTCGC CTCTATTAAG GCGGGCGTAC AAGCCCGGGA AGTACACGCA
ATAGCCCTCA GAGCCTTGAA GGAAGCGGGG CTTTCACAGT ACTTTAATCA CGGCCTGGGG
CACGGCGTCG GCGTCGATAT ACACGAGGAA CCGTACCTTA ACCTTCAGAG CGAAGCTGTG
CTCCTCGAAG GAGACGTAGT TACGGTTGAG CCGGGAGTCT ACCTGCCCGG CTACGGCGGA
GTACGCATAG AGGACATGGT GTACGTGGAG AGGGGCGGAG GACGCCTGCT GACATTCTTC
AGCAAAGACA TGGTGGTTTA G
 
Protein sequence
MIDFKQHVSK VVERILVPND LNYLVVMSAS NIFYLSGSDA PSALVVSKEG EVSALASRLE 
YFRAVSETSG LRVVAFAREG EDVSEYEEVV RGDFYEALSR MVSGSERIGV VGASCEAKEK
LAEKTGKQLY DYSKEFSLIR RVKDPGELEA INRAARLAEL AMRKALDTLE PGVTESEVAS
EILKVIVSSG AYPSFPPIVA FGEHAAHPHA KPSLRRLIKG DFVKIDLGAK VDGYCSDMTR
TLVFGEPSEK QRRIFEAVVK AQESALASIK AGVQAREVHA IALRALKEAG LSQYFNHGLG
HGVGVDIHEE PYLNLQSEAV LLEGDVVTVE PGVYLPGYGG VRIEDMVYVE RGGGRLLTFF
SKDMVV