Gene Tpen_0665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0665 
Symbol 
ID4601623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp614229 
End bp616073 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content54% 
IMG OID639773438 
Producthypothetical protein 
Protein accessionYP_920070 
Protein GI119719575 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0175] 3'-phosphoadenosine 5'-phosphosulfate sulfotransferase (PAPS reductase)/FAD synthetase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.545372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAGTT TCGAGGTGCT TAAACGCTCC CAGGTTCATG TATGGTGGGA TTTAAAAGAG 
AACCTCCCAC GGCTTACCGC TTCGAATAGT GAGCCGTGCT TCAGAGTCCG CCTCTCTCCT
CCAGGCGACG CAAGGCCCCT CGTCGGCCAC TATTACAAGT TTGTCTTCAG GGTCATCGAG
ACATACTTCG GTGGAGACCC TCCCTTTCCC AGGCGTAAGA TAGCCCTGGT GAACAAGGTC
CCGTATCCCG ACCTCGCGGA AGAAGTAGTT ATCGACGGTC AGGTAGCCGG GCATCTACTC
TACGACCTGA GGTCGGAGCG ATGGCGGTTC AAGCCGCTGT ATGCTGGCGC TGAGGAAGCC
CTGAGGAATA GGAAAGGGTA CTACGCTATT GTAGATTTGC CAAGGTTGTC GAGAGGCTAC
GTTGTGAAGA AGGACTCGGT AGTAGAGGCT AACCTTCCTG AGGGGGACGA GTACGTAACC
ATAGGCACGA AGAGCGGGGA CTTCTACGGC GTAGGAGCTA TGCTCAGGAA CAGAAGGATC
TACGTGCTGA AGGCCTGGAG GGCGAGGCCC AGGGCGTGGC TCTCTAGAGA CCCTGACTGG
AGTACCGCGG TGCTACGCAA CAGGGAATAC CTGCTCGCAA AGGAGGCTGA AGCCGTAAGC
TTCATCAGAG AGGTTGCCGA GAAGTACCGG CTACCCGTCT TCGTATCTCT CTCGGGAGGC
AAAGACAGCC TCGTAACTCT CCACCTCGCA GTCAAGGCCC TTGGAAACGA GAAGGTGAAA
GCCCTCTTCA ATAACACGGG CCTCGAGTTC GAGGAGACGG TAGAGTACGC TAGGAGAATC
GCGGACTACT ACGGCGTTGA ACTCATAGAG GCGGATGCCG GCGACAACTT TTGGAGAGCA
CTCCCAGTCA TGGGGCCCCC TGCGAGAGAC TACAGGTGGT GCTGCAAGGT GACTAAGTTC
TCGACGATCT CTAGGGCCGT GAAGAAATTC TTCCCAGAGG GAGCCCTGAG CCTCGTAGGG
CAGAGAAAGT ACGAGTCGAG CGCTAGGGCC CTCTCTCCAA GAATCTGGCG GAACTACTGG
CTTCCAGGCG TGGTGGCGGC AAGCCCGGTG CACGATTGGA GCGCGATGGA TATTTGGCTG
TACATCTTCA TGGAGAGGTT GCCGGTGAAC AAACTCTACT ACTACGGATT CGACAGGCTC
GGCTGCTGGC TCTGCCCGGC AAGCGAGATG GGAGAACTCG ACCTTTTAAG GATTGTTAAG
CCTGGTCTCT ATGATAAGTG GAAGTCCTAC CTGGAGAGCT ACGCCCAGAT GAACGGTCTA
GGCGAGGAAT GGGTTAAATT TGGACTCTGG AGGTGGGTTA GACCCCCGAA GGATATACAG
AGGATCTGTG TCTCCCAGGT CCAGGCGAGG AGAGGAGCGC GCTACGACAG ACACGCATCA
GGCAATACCG TAGAGTACAG GCTTCATAAC CCCTCGGTGC GCATTATCGA GGAGAAAGTT
CGAAACCTTT GGCACACCCT TAACAAACCG CTCGAAATAG AAAGCGTACA GGTAACAGGA
AATGCCGTAG CGCTGAGATT TAAGCGTGAA GCTACGGCTA GCGAGCTAGA ACTTGTCGAC
AGGGTTTTGA TAAGGTCGTA TTTCTGCGTA GAATGTTTAG AGTGCTCCAA CTGGTGCCCC
ACGAAGTCTA TAAGCATAGA TTCAGAGAAC GGAGGGATAA AGGTCAACGA ATCCACGTGC
ATCCACTGCG GGACTTGTAA CTACAAGTGC CCCGTAGTCG AGTACACGTT TAAGCATCTC
GAAATGCTTA AACCGCAACC ACAAACACGC ACTACGGCTT CTTGA
 
Protein sequence
MSSFEVLKRS QVHVWWDLKE NLPRLTASNS EPCFRVRLSP PGDARPLVGH YYKFVFRVIE 
TYFGGDPPFP RRKIALVNKV PYPDLAEEVV IDGQVAGHLL YDLRSERWRF KPLYAGAEEA
LRNRKGYYAI VDLPRLSRGY VVKKDSVVEA NLPEGDEYVT IGTKSGDFYG VGAMLRNRRI
YVLKAWRARP RAWLSRDPDW STAVLRNREY LLAKEAEAVS FIREVAEKYR LPVFVSLSGG
KDSLVTLHLA VKALGNEKVK ALFNNTGLEF EETVEYARRI ADYYGVELIE ADAGDNFWRA
LPVMGPPARD YRWCCKVTKF STISRAVKKF FPEGALSLVG QRKYESSARA LSPRIWRNYW
LPGVVAASPV HDWSAMDIWL YIFMERLPVN KLYYYGFDRL GCWLCPASEM GELDLLRIVK
PGLYDKWKSY LESYAQMNGL GEEWVKFGLW RWVRPPKDIQ RICVSQVQAR RGARYDRHAS
GNTVEYRLHN PSVRIIEEKV RNLWHTLNKP LEIESVQVTG NAVALRFKRE ATASELELVD
RVLIRSYFCV ECLECSNWCP TKSISIDSEN GGIKVNESTC IHCGTCNYKC PVVEYTFKHL
EMLKPQPQTR TTAS