Gene Tpen_0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0789 
Symbol 
ID4601137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp741183 
End bp742595 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content57% 
IMG OID639773565 
Producthypothetical protein 
Protein accessionYP_920194 
Protein GI119719699 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.926627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGATA TGATGGGGTA TGAGGAAATC GAGAAGACTG TTAGGGAGGA GACGTGGAGG 
CGTGCGGCTC TAAGCTTGTA TGCCACCAGA ACAGAGGATA AGAGGAGGGG CAGGACGCAC
TACCGCGGCC TCTTCGACAC TGTGACCGAG GTCGACTGGG ACTTCACCAG GTTCCTCGTA
AACGGCTACA CGGTTGTGCC GGACGAGGCT TACCCCAGGT TCAGCAAGCT GTTCGACTAC
GACGCCCGCC AGTATCTCTT GTTGAACGAC GACGAGAGGC CGAGGGAGGG AGGCGCAGTC
ATAGAGCTCA GAGGAAGGTT GCAAGCAATC GTCGACGCCG GCGCCGATGG TCTTAGAGCC
GAGAGGCATG GAAGGCACTG GCGCGTGTAC GTGCCACACG AGAACTGGCA CGTCGTCGTG
AGCAAGCCTA CTAACGGCTG GTCCGTACAC GTACCGCTGG AGGGCTACTG GACTGAGACT
AGTTTCCCGG AGGTTCTCGT GAGGACATCG CCAGACGTGC TTAGAAGCCT GCAGAGGGGG
TGGATCCTGA CGGATGTGAC ACCCCCTCAT GGGCGCCACA GCGATGTACG TTTCAGCACA
ACGCAACCGT GGCAGTTGCC AGCCACGCTC GCGGCCTTTC CTAGTGGCAA TATCCGCTTG
GGCGTCACGG CGGGCATTCT TGGCAGTACC AGGCTAAGCA TCGAGTGGCA TGTACTCGTC
TACGGCTACG AGGAGGAGCT GGGCTGGGCT TCAAGGCTTG TCGGCGAAGT TAAACGTGTA
GAGTATCGCA GGCTGGTCGA GGAGTGCAAG GCGCTTAACG GCGATTCCGT GGCGCTACTG
ACTGCGTATG AAGGAGACGG CATACTCGCG TATTTCCTGA GAATGAGAGA GCTTTACTTC
AGAATTAGAC ATGAGATAGT TTACCTGCCA GCTGAGAGCG CCATTGTCAA TGCCCGCTTA
GCTGTGGAGA GGGCTAGCGA GTACACAAAG TTCGTCTCAT TGGTGACGAA ATGCGCCAAG
ATTAAACACT TCTTGTTCGT CGGCTACGGG ATACCGCAGA AGAGGGGTAG GAAGAACGGG
CAGAAAAACA ACCCGTTCTA CGCCGAGATA GCGGGGGCTA AGCTACACGT AGTCTACTAC
ACGGCTGATA ACCGTATTTA CGCGAGGATC GTGGTTGATG CTGTGCCTCT AGGCTGGGTG
GAGGAGGCGC GTGCTCAGGG CTGGGACGTC CGGGTGGTTC GCATGGGGAG CAAGGAATAC
TACCAGGTGA CTCACAATTC TCTCTTTGAA CACGCGCGTA GCGACGCAAA GCTACGCGCA
ACGCTCCTCG CCTTCACAAG GTACAAGGCC ACACAGTACC CCAAGGCGCA GAGCCTTGTA
AAGCTCTTAG AAGAGCTGGG GACAGAAGAC TAA
 
Protein sequence
MVDMMGYEEI EKTVREETWR RAALSLYATR TEDKRRGRTH YRGLFDTVTE VDWDFTRFLV 
NGYTVVPDEA YPRFSKLFDY DARQYLLLND DERPREGGAV IELRGRLQAI VDAGADGLRA
ERHGRHWRVY VPHENWHVVV SKPTNGWSVH VPLEGYWTET SFPEVLVRTS PDVLRSLQRG
WILTDVTPPH GRHSDVRFST TQPWQLPATL AAFPSGNIRL GVTAGILGST RLSIEWHVLV
YGYEEELGWA SRLVGEVKRV EYRRLVEECK ALNGDSVALL TAYEGDGILA YFLRMRELYF
RIRHEIVYLP AESAIVNARL AVERASEYTK FVSLVTKCAK IKHFLFVGYG IPQKRGRKNG
QKNNPFYAEI AGAKLHVVYY TADNRIYARI VVDAVPLGWV EEARAQGWDV RVVRMGSKEY
YQVTHNSLFE HARSDAKLRA TLLAFTRYKA TQYPKAQSLV KLLEELGTED