Gene Tpen_0475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0475 
Symbol 
ID4602034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp431007 
End bp432710 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content56% 
IMG OID639773243 
Productradical SAM domain-containing protein 
Protein accessionYP_919887 
Protein GI119719392 
COG category[C] Energy production and conversion 
COG ID[COG1031] Uncharacterized Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACGG TGGTGCTCCT CGACGGCTAC AACGACGAGC CTGCAGGGCT CGGAGTGCCG 
CCCTACATCG ACGTATACGC TAGGTACGTG GCCGGAGCTG TTTGGACCGT AGAGCACTCC
GCCACCGTCC ATTACTTTAC GATAGACTTT GTGCGCGAAA ACCCGGACAC TTTCTTCAGA
GTTGCTGGCA AGAGTGACCT TTTAGTCGTT TTTGGGGGAG TAGTAGTTCC CGGAAAGTAC
CTGGGCGGCA AACCGATCAC AGCGGAAGAA CTCGTCAGGA TTCCCAGATC GGTCGAAGGA
CCGGTAAAGA TACTTACAGG ACCGTTCGTG CGCTTCGGGC TCGGCGTGCG TGGCGGGGAA
AGAGCTGTGC CACGCGAAGA GTTCGAGGAC GCGTACGACC TAGTTGCTCC CGGAGACCCC
TGGCTAGTCG TATACGAGTA CATGCTCGAG AAATCCTTGG AAAAAGTGAA CCCCTACGCC
GTGTCGAAAG ACTATAGCCT CGTCGACAGA TTCGCCGTTC GCGGCGCCCG CATCGTCCTA
CAACACCCCA ACCTCGGCTA CAACCTTACA GCAGAGATTG AGACGTACAA GTCGTGCCCG
AGGTGGGTCA CCGGGGGCTG TAGCTTCTGC ATAGAGCCAC GGCTAGGCAG AGTAGTATTC
CGGGAGGCCA AGTCCATAGG AGAGGAGGTG AGGGCTCTCT ACGAGCTCGG CGTGAGGGCG
TTTAGGCTTG GACGCCAAGC AGACTTCCTA GCCTACAAGG CCAAGGGGGT AAACGAGGTA
GAGTTTCCAG AGCCCGACCC CTCAAGCATA GAGGAGCTCA TGCGGACTGT CCGCCTCGCA
GCTCCGAGTG CAGAGACCAT ACACATCGAC AACGTAAACC CCGCCACGAT ATACATCCAC
AAGGAGAAGG CGCGCGAAGC CCTGAAGTCC ATTGTCAAGT ACCACACGCC GGGCGACGTA
GCGGCGTTCG GACTCGAAAG CGCCGACCCG GTAGTCGTGA AGCAGAACAA CCTTGGAAAC
GACGCCGAAG GAGTACTAGA GGCGATTAAA ATAGTCAACG AGGTAGGAGC CAGGAGAGGG
TGGAACGGAT CCCCCGAGCT ACTTCCCGGG GTGAACTTCG TCCTCGGGCT CCCCGGAGAA
ACCGCTAAGA CGTACAGAGC CAACAAGGAG TTTCTAGAGA GGATCCTGGA GGAGGGGTTG
CTCGTGAGAA GGGTTAACGT GCGCGAAGTC CTCCCGCTAC CCGGAACACC CATGTGGAGC
GTTGGCACGA GCATTGCAGA GGCGCATAGA AAGTACGCGA AAGCGTTCAA GAAGTGGGTC
AGGAGAAACT TCGACGAGAA AATGCTTCCA AAGGTGTACC CTAAAGGAAC AATATTGAGA
AATTGCTACG TAGAGTACCA AGCCGGTCCA ACGACCATAG CGAGGCCCAC GGGCAGCTAC
CCGATAGCCG TCTTTCTAGA GGAAAAAGAA GGCTTGGCGA AAATGATGAA GGTAGACTGC
ATAGTCACCG GACACAAAGC CAGGTCACTG CTCGGAGTCC CCCTACCGGT GAACCCTCAG
AAACACTCCT TGAAGACATT AACGAAAGTA TTCGGAAGGA AAAGAGCCTT CGAAGTCAAA
CGGGGACTTG TACCCAAGGA TCCCCTTGCG AGGTACATCA CTGGAAGAAT ATCCTCTGTA
GCTGCATCAT CTCATGGCCT GTAG
 
Protein sequence
MTTVVLLDGY NDEPAGLGVP PYIDVYARYV AGAVWTVEHS ATVHYFTIDF VRENPDTFFR 
VAGKSDLLVV FGGVVVPGKY LGGKPITAEE LVRIPRSVEG PVKILTGPFV RFGLGVRGGE
RAVPREEFED AYDLVAPGDP WLVVYEYMLE KSLEKVNPYA VSKDYSLVDR FAVRGARIVL
QHPNLGYNLT AEIETYKSCP RWVTGGCSFC IEPRLGRVVF REAKSIGEEV RALYELGVRA
FRLGRQADFL AYKAKGVNEV EFPEPDPSSI EELMRTVRLA APSAETIHID NVNPATIYIH
KEKAREALKS IVKYHTPGDV AAFGLESADP VVVKQNNLGN DAEGVLEAIK IVNEVGARRG
WNGSPELLPG VNFVLGLPGE TAKTYRANKE FLERILEEGL LVRRVNVREV LPLPGTPMWS
VGTSIAEAHR KYAKAFKKWV RRNFDEKMLP KVYPKGTILR NCYVEYQAGP TTIARPTGSY
PIAVFLEEKE GLAKMMKVDC IVTGHKARSL LGVPLPVNPQ KHSLKTLTKV FGRKRAFEVK
RGLVPKDPLA RYITGRISSV AASSHGL