Gene Tpen_1658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1658 
Symbol 
ID4601735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1604310 
End bp1605644 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content60% 
IMG OID639774431 
Productrhomboid family protein 
Protein accessionYP_921056 
Protein GI119720561 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.598011 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTTGC CGTTTCAGCG AGACTACGAG GACATAGAGA GGCGTGGAGC CCCGGTCACG 
CTTGGAATAA TACTCGTAAA CGTGGCGGTC TACCTCGTGT CGAGCTACGA GAACGGCTTC
CTCGCGGTCT CCGACGCCTG GGTTAACGCG TTCGCGTTTG TCCCAGCGTA CTTCGCGAGG
CCCGAGCACC TCTACAGGCT CTTCACTTCG ATGTTCCTGC ACGCCAACCT GGCGCACATA
TTCTTCAACA TGCTTTACCT CTACACGTTC GGGAAGAGCG TCGAGGCAGT CCTCGGTAGC
GAGAGGTACT TCCTGCTTTA CTTCGCGAGC GGTATCCTTG CGAGCGTCTT CCACACGGCG
TTCCTACCCA TAGAGGGCGC GAGCTCGGCT TTCGTGCCGG CCCTCGGTGC CAGCGGTGCT
ATAAGCGGGG TTCTAGGCGC GTACCTCCTG CTGTTCCCGG GGACCAGGCT CACGATGTGC
TTCTTTTACG TATTCATCCC GCTATGCTTC ACCATGAAAG CGGCGGCTTA CCTCGTGTTC
TGGTTCGCGC TACAGATACT GCAGGGGTTC CTGGGGGCTA GCCTCGGCGT CGCGGTATTC
GCGCACGCAG GGGGGTTTAT AGGCGGGCTC GCGCTACTTC CGGTGCTCGT CAGCGAGGAG
AGGATAGGGC TGCTCAGGCT GTACTCTTCG ATGCACTACT TCTTCAGGAA CATATTCTTC
ACTGAGAGGG GCTTCACGAG GCTCAGCAAG GCAGTCGTAG CGGTGCTGAT AGGGCTCGTC
GCGGCCGCGG CAGTCTACTC GGCTGTAGAG GCGGAGACCA CGGGGGGTGT GAACAAGGTG
TTAACGGTGA GCGTCACGGG TAGAGGCGTC AACGATTCTG AAAGCGTGAT AATCCAGCTC
CAGCCGGGCG GAGTCCTGGA CGTCACCCCC ATATCGAGTA GCGGCGTGAG GGTCGTTGTA
AACCGCCTCA GGGCTGCCAA CCTCCTCTAC AACAGCCAGG CCGCCGGAAA GAGTATCTCC
GTGGATAAAA GCCTGAGGGG AGTCGTCAAC GGCTTACCCG TCGAGATAAC GATAAAGGCT
AACCTCTCGT TCGACTCCTA CGGGTTGCTA GACAGGGGGC AGGGCCACGT GACCACCGAC
GTCCTCGCCT GCGATGCCTA CGGCAGGTGC ACCGTTTCGG GTAAGGGTAG CTACGACTTC
TCCGTGAGCA GCGAGGCCAC GATGGCGGGC TTCAAGGGGA TACCCATAGC GGAGCTAGCC
CTTCTCTCGC TGGCTTCGAG CCTGGCCGCC GTGGTCAACG TTCTTAGGGC GGAGAGGTAC
GCGATAGTAG AGTAG
 
Protein sequence
MGLPFQRDYE DIERRGAPVT LGIILVNVAV YLVSSYENGF LAVSDAWVNA FAFVPAYFAR 
PEHLYRLFTS MFLHANLAHI FFNMLYLYTF GKSVEAVLGS ERYFLLYFAS GILASVFHTA
FLPIEGASSA FVPALGASGA ISGVLGAYLL LFPGTRLTMC FFYVFIPLCF TMKAAAYLVF
WFALQILQGF LGASLGVAVF AHAGGFIGGL ALLPVLVSEE RIGLLRLYSS MHYFFRNIFF
TERGFTRLSK AVVAVLIGLV AAAAVYSAVE AETTGGVNKV LTVSVTGRGV NDSESVIIQL
QPGGVLDVTP ISSSGVRVVV NRLRAANLLY NSQAAGKSIS VDKSLRGVVN GLPVEITIKA
NLSFDSYGLL DRGQGHVTTD VLACDAYGRC TVSGKGSYDF SVSSEATMAG FKGIPIAELA
LLSLASSLAA VVNVLRAERY AIVE