Gene Tpen_0335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0335 
Symbol 
ID4601702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp302907 
End bp304826 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content56% 
IMG OID639773095 
Productanaerobic ribonucleoside triphosphate reductase 
Protein accessionYP_919747 
Protein GI119719252 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1328] Oxygen-sensitive ribonucleoside-triphosphate reductase 
TIGRFAM ID[TIGR02487] anaerobic ribonucleoside-triphosphate reductase
[TIGR02827] anaerobic ribonucleoside-triphosphate reductase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGAGA TTGTGCAGCA AAAGAAATCC CTGGTTAGAG ATCTAGAAGA GTCGTACCTC 
GCTGGTATAG ACTGGGAGAG GAATGAAAAT GCTAACCACA ACGTCAGCTA CAGTAACTTT
AGGAATTACT TATTTGAAAA GCTCGTGAAG TCTAAGGAAG TCCTCTCCGA ATACCTTCCG
CCTCTCGCCA TTGAGCAACA CTATAGAGGA GACATCCATA TACATAAACT GCCGGACAGC
CTTTGGATTC CCTACTGCGC CGGGTGGAGC TACAAAAAGA TACTCCAGCT TGGCCTTAAG
ACTCCGACTA TAATCTCTGC CCCTGCCAGG CACTTTGACA CCGCTGTGGC ACACCTGGTG
AACTTCTTCT TCATGGGTGC GCAAGAGTGG ACGGGAGCAC AAGCGGTGAG CGCCTTCGAC
CTATTCGTGG CGCCCTTCGC GGAGGGCGAC GGGCTCTCCA AGGAGAGGAT AAAGCAGACT
CTGCAAGGTA TGCTCTTCGA GCTGAATTAC CCAGCCCGCG CCGGGTATCA AAGCCCGTTC
ACAAACATCA CGCTGGTCAT GGATACCAGC AAGGATATGC TTGAAGGGGA AGCCATAGTG
GCCGGGAAGA AGGCCGGAAG TCTCGGCGAC TACATAGAGA AAGCCCTCGC CGTGGACGAG
GCCCTCTTCG AGCTCTACGC GGAGGGAGAT GCCGCTGGAC AGCCGTTCAC TTTCCCGATA
CCCACCATAA TGCTCACGAA GAACTTCGAC TGGAACGGCA GAAGGTGGGG AGAACTCACA
GACAAGATAT TCACGGCGCT TTCAAGGAAG GGATCCGCGT ACCTGCTGAA CGGTTACGCT
AGCAACGTGG AAGCTCTCTA CGCCATGTGC TGCAGACTAA CGATAGACGT CAACCACGTG
ATGGGCAAGT CGGGAAACGG CTCGCCGTCC TTCAGCCTCA AACTCTCACG TGCAGACGAG
GAGGACGCCT TCGAGAAGTT CATGAAGTCT CGAGAGAACA GCAGGACCTA CGGTATATGG
GCTCTGCCGG ACGCTACTGG GAGTATTGGG GTCGTCACTC TAAACCTGCC GCGCCTCGCG
GCGCTCAGCA AGGGCGAGTG GCCGCTCTTT GAAGAGTTAC TAGACTCCTT GCTGAAGAAC
GCGAGGGACG TGCTACTCAC ATGGAGGAAG AGGTACGAGA AAACACTGGC GGCAGGCCTC
ATGCCTCTAA CAAAGACCTA CCTAGGCCAC TTTAACCACC ACTTTAACAC TATAGGGGTG
ATAGGGCTCC CGGAAGCCGC CGCGAACTTC ATGAGGAACC CCAAGGTGTG GTTCGAGGGC
GGCGTCGGGG AGTTACAAGA GGCGGTGGAG ATAGAGAAGA AGATGGTCTC ACTGATAAGA
GAGAGAGCCT CGGAGCTGGA GGAGAGGGAC GGGTACCTCT ACAACGTCGA GGAGATCCCA
GGTGAGAGTA CAGGCTACAA GCTGGCGCTC GCCGACCTGC AGCTCTTCAA GGAGGAGCAC
GCCAGGGGCG AGGTGCTGAT ACCGAGCGAT GGGAAGGCGC CGTTCTACAG CAACAGCGTG
GTCCCGTACT ACGCCGACGT CCCGATGTAC CAAAGGGCGA TATGGGAGGG GTACGTCCAG
CAGGAGTTCA CCGGCGGCGT CATGATGCAC CTGTTCCTAC ACGAGCAACC CGACCCGAAA
GCGCTACGGA ACCTCGTGTA CAGGATTGCT ACGAACACGA AGGTAGTATA CTTCAGCATA
ACTCCGACGA TAACTGTCTG CAGGAAGTGC GGCAGGAGCA CCACCGGTTA CCTCGACGGC
TGTCCTCACT GCGGCTCCAG CGAGGTGGAA CACTGGAGCA GGATAGTGGG GTACTACAGA
CCAGTTAGCA ACTGGAACCC GGGTAAAAAA GCGGAGTTCA AGCTAAGGGT AACCTACTAG
 
Protein sequence
MTEIVQQKKS LVRDLEESYL AGIDWERNEN ANHNVSYSNF RNYLFEKLVK SKEVLSEYLP 
PLAIEQHYRG DIHIHKLPDS LWIPYCAGWS YKKILQLGLK TPTIISAPAR HFDTAVAHLV
NFFFMGAQEW TGAQAVSAFD LFVAPFAEGD GLSKERIKQT LQGMLFELNY PARAGYQSPF
TNITLVMDTS KDMLEGEAIV AGKKAGSLGD YIEKALAVDE ALFELYAEGD AAGQPFTFPI
PTIMLTKNFD WNGRRWGELT DKIFTALSRK GSAYLLNGYA SNVEALYAMC CRLTIDVNHV
MGKSGNGSPS FSLKLSRADE EDAFEKFMKS RENSRTYGIW ALPDATGSIG VVTLNLPRLA
ALSKGEWPLF EELLDSLLKN ARDVLLTWRK RYEKTLAAGL MPLTKTYLGH FNHHFNTIGV
IGLPEAAANF MRNPKVWFEG GVGELQEAVE IEKKMVSLIR ERASELEERD GYLYNVEEIP
GESTGYKLAL ADLQLFKEEH ARGEVLIPSD GKAPFYSNSV VPYYADVPMY QRAIWEGYVQ
QEFTGGVMMH LFLHEQPDPK ALRNLVYRIA TNTKVVYFSI TPTITVCRKC GRSTTGYLDG
CPHCGSSEVE HWSRIVGYYR PVSNWNPGKK AEFKLRVTY