Gene Tpen_0506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0506 
Symbol 
ID4601002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp460738 
End bp462156 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content57% 
IMG OID639773274 
Producthypothetical protein 
Protein accessionYP_919916 
Protein GI119719421 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGGAG AGAAGATGAT GGGGTATGAG GGGGTCGAGG GAGAGGTTAG GGAGGAGACG 
TGGAGGAGGG CGGCTTTAAG CCTCTACTCG ACCGAGGTCG AGGATAAGAG GAAGGGCAGG
GTACACTACC GCGGCCTCTA CGATACGGTG TCTGGGATTA ACTGGGACTT TACGCGCTTC
CTGGTGAACG GCTACAGCGT AGTGCCGGAC GAAGTCTACT CCAGGTTCTA CAGGTTCATC
GACTATGACT TGAGGAAGTA TCTCTTGCTG AACGACGACG AGAAGCCGAG AGAGGGAAGC
GCGGTGACGG AGCTCAGAGG AAGGCTACAG GCAATAGTCG ACGCCGGCGC CGACGGTCTT
AGAGCTGAGA AGAAGGGTAA AGTCTGGCAT GTATACATAC CTGGAGAGAA CTGGCACGTC
ATCGCACACA AGCCTACATA TGACTGGATC GTACACGTCC CATTGGAGGG CTTCTGGACC
GAGGCTAGTT TCCCGGAGGT TCTCGCGAGG ACATCGCCAG ACGTGCTTAG AAGCCTGCAG
AAAGGGTGGG TCTTAACGGA TGTGACACCC CCTCATGGGC GTATCAGCGA CGTGCGCTTC
GCCACGACGC AACCGTGGCA ACTCCCCAGT ACGCTCGCGG CTTTTCCAAG CGACCACGTT
GACGTAGGTG TCACGGCAGG AGTCCTCGGC AGTACCAGGA TGAGCATTGA GTGGCGCGCA
AACGTCTACG GCTACGCGGA TGAGCTAGGC TGGGCTTCAA GGCTTATCGG CGAGGTTAAA
CGTGTAGAGT TCCGCAGGTT GGTTGAGGAG TGCAGGGCGC TCAACGGCGA CTCGGTGGCA
CTGGCGACCG CTTTCCTAGG GGACGGCGAG CTCGAATACT TTCTAAGGCT TAGATGGCTC
TACTTCAGGG TTGGACAGGA GCACATCTAC TTACCGGCTG AGAGCGCCAT AATCAACGCT
AGGCTTGCCG TCGAGCTAGC ACCAGAGTAC ACAAAGTTCG TATCACTGGT AACGAGATGC
GCTAAGATCA AACACTTCCT CTTCGTCGGC TTCGGAGCAC CGCAGAGGAG GGGCAGGAAA
AACGGGCAAA GCCCTAGCCC ATTCTACGCC GAGGTAGCGG GAGCTAGACT AGTCCTAGTC
TACATCTCTA CGAAGAACAA TATCTACGCC CGTATAGCTG TGGACGGCGT GCCTCCAGGC
TGGTACGGGC GCGCGCTGGA GGAAGGCTGG GACGTCCGGG TGGTTCGAAT GGGGAGCAAG
GAATACTACC AGGTTACACA TTGCTCCCTG TTCGAGCACG CCCGCTACGA CGCGGCTCTG
CGAGAAACAC TCCTCGCCTT CGCGAAGGCG AAAGCCGAGC AGTACCCCAA GGCGCAAAGC
CTCGTAGAAC GCCTCGAAAA GCTGGGGACA GGAGACTAA
 
Protein sequence
MVGEKMMGYE GVEGEVREET WRRAALSLYS TEVEDKRKGR VHYRGLYDTV SGINWDFTRF 
LVNGYSVVPD EVYSRFYRFI DYDLRKYLLL NDDEKPREGS AVTELRGRLQ AIVDAGADGL
RAEKKGKVWH VYIPGENWHV IAHKPTYDWI VHVPLEGFWT EASFPEVLAR TSPDVLRSLQ
KGWVLTDVTP PHGRISDVRF ATTQPWQLPS TLAAFPSDHV DVGVTAGVLG STRMSIEWRA
NVYGYADELG WASRLIGEVK RVEFRRLVEE CRALNGDSVA LATAFLGDGE LEYFLRLRWL
YFRVGQEHIY LPAESAIINA RLAVELAPEY TKFVSLVTRC AKIKHFLFVG FGAPQRRGRK
NGQSPSPFYA EVAGARLVLV YISTKNNIYA RIAVDGVPPG WYGRALEEGW DVRVVRMGSK
EYYQVTHCSL FEHARYDAAL RETLLAFAKA KAEQYPKAQS LVERLEKLGT GD