Gene Tpen_1384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1384 
Symbol 
ID4600667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1337847 
End bp1339703 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content69% 
IMG OID639774159 
Productferredoxin 
Protein accessionYP_920784 
Protein GI119720289 
COG category[R] General function prediction only 
COG ID[COG3894] Uncharacterized metal-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.809303 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGTCG TTCGGGTGGA GCCTTACGGG GCAAGGGTGG AGGTTGAGAG CGGGGCGACG 
CTTCTCGAAG CCCTGGCGAG GAGCGGGGTA AGGGTTGCCT CGGTGTGCGG GGGCCGCGGC
TTCTGCGGTA AGTGCAGGGT GCTGGTAACC GGGGGCTCCT CCGCCCTCTC GCCTCCGTCG
AGGTCTGAGT CCATGCTCCT GGGCGGGGAC CTGGGATCCG GCTACAGGCT TGCCTGCCAG
GCCAGGGTGC ACGGGGACGT CGCCGTCTAC GTCCCGGAGG AGAGCAGGGA GTCGCCGGGG
GAGAGGGTGG CTGCGGTGGA CGGCTACGCG AGGCCCGTCA GGGTGGCGCC GCCGCTCAGG
AGGGTCTCGC TCTCGCTGAC TCCCCCGAGT CTCTCCGACT GGAGGTCTGA CGAGGAGCGG
CTAGCCGAGG GGCTTGGGAG GGTTCTCGGG GGGTTCGAGC CGCCGGGGCT CGACGTTCTG
AGGAGTTTGC CGCGGGTTGC CAGGGAGTCC TCCTGGTCCT TGGAGGTCGT CGCGTGGAGG
GGCAGGGTGC TCTCGGTTGA GCCGGGGGGC TCGGGTAGGG GGACGTACGC GGTGGCGGTG
GACCTCGGGA CCTCGAAGCT CGTAGCCCAC CTCGTAGACG CCTCTAGGGG CGTGGTCGTG
GCGAAGGGCT TCGCCGAGAA CCCCCAGCTC GCCTACGGGG AGGACATAAT GACGAGGATG
GACTTCGCGT CGAGGAGCCC TGGCAACCTG GACCTTCTCC GCCGCGTGCT CGTCGAGGGG
GTTAACGGGC TCGTGGCGAG GCTCCGCTCC TCGCTGGGCG TACGGGCGGA CGAGGTCTAC
GCGTTCGCGT TCGCCGGCAA CACGGCTATG CAGCACTTCC TGCTGGGGCT CGACGTGTCC
CAGCTCGCGA GGGCCCCCTA CGTGGCCGTG ACGAGGAGGG CCCTGGAGGT GAGGGCGGCG
GACCTCGGCT TGGAGGCGGG GCCGGGCGCG GTTGCCCTCG TGTTCCCGGT GATAGGGGGC
TTCGTGGGCG GCGACGCGGT GGCGGACGTG CTGGCGACGG GGCTCCACAG GAGGGATTAC
CCAGCCATGC TGGTAGACGT GGGGACGAAC ACCGAGGTTG TGGTTGGCTC CGGCGACCGC
TTCCTCTCCG GCTCCGCGCC TTCGGGCCCC GCTTTCGAGG GCATGCAGAT AACGTTCGGG
ATGAAGGCTG TCTCCGGCGC GATAGACAGG GTGAGGGTGG GGGAGGACGG GGAGGTCGAG
TACACCACGG TGGGGGGCGC GAGGCCCAGG GGGATATGCG GATCCGCTAT GATAGACCTC
GTCGCCGAGC TGTACAGGGC GGGCCTCCTG GACGCCCGGG GGAGGTTCAG GAGGGACGCT
TCGACGAGGA GGCTGAGGAG GGGGGAGAGG GGCATGGAGT TCGTCGTCGC GTGGGCCAAG
GAGACGTCGA TAGGCCGGGA CATAGTCTTC ACCGAGAAGG ACGTCGAGCA GGTGTTGCTG
GCGAAGGCCG CTGTGTCGAG CGCGGCTAGG ACCCTCATGA AGATGAGGGG GTTCAAGGCG
GAGGAGCTGG AGGAGGTCGT GGTGGCGGGC TCCTTCGGGT CGAGCCTCAA CGTCGAGAAC
GCCCTGGAGA TAGGCCTCCT CCCGCCCGTG CCCCCGGAGA AGGTGTGGTT CGCGGGGAAC
ACGGCGGTGG GGGGAGCGGT GCTCGCGCTG GTATCGGAGG AGGCGCTCAG CGAGCTCGAC
GAGATACTCT CGAAGGTGGA GTTCGTGGAG TTCGCGGCTA GCCCAGAGTG GAAGGCGGAG
TTCATGAACT CGCTCTTCAT ACCCTACAGG GAGCCGCCGA GCCGCCTCTC CAGGTAG
 
Protein sequence
MPVVRVEPYG ARVEVESGAT LLEALARSGV RVASVCGGRG FCGKCRVLVT GGSSALSPPS 
RSESMLLGGD LGSGYRLACQ ARVHGDVAVY VPEESRESPG ERVAAVDGYA RPVRVAPPLR
RVSLSLTPPS LSDWRSDEER LAEGLGRVLG GFEPPGLDVL RSLPRVARES SWSLEVVAWR
GRVLSVEPGG SGRGTYAVAV DLGTSKLVAH LVDASRGVVV AKGFAENPQL AYGEDIMTRM
DFASRSPGNL DLLRRVLVEG VNGLVARLRS SLGVRADEVY AFAFAGNTAM QHFLLGLDVS
QLARAPYVAV TRRALEVRAA DLGLEAGPGA VALVFPVIGG FVGGDAVADV LATGLHRRDY
PAMLVDVGTN TEVVVGSGDR FLSGSAPSGP AFEGMQITFG MKAVSGAIDR VRVGEDGEVE
YTTVGGARPR GICGSAMIDL VAELYRAGLL DARGRFRRDA STRRLRRGER GMEFVVAWAK
ETSIGRDIVF TEKDVEQVLL AKAAVSSAAR TLMKMRGFKA EELEEVVVAG SFGSSLNVEN
ALEIGLLPPV PPEKVWFAGN TAVGGAVLAL VSEEALSELD EILSKVEFVE FAASPEWKAE
FMNSLFIPYR EPPSRLSR