Gene Tpen_1531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1531 
Symbol 
ID4600373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1477566 
End bp1478786 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content64% 
IMG OID639774305 
ProductSufBD protein 
Protein accessionYP_920930 
Protein GI119720435 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0719] ABC-type transport system involved in Fe-S cluster assembly, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGCTGA TGGAGCGCGT TGAAGCGGCT AGGAAGGCGC TGTCGAAGCC CTCTCCCTAT 
GGCCCGGACC TCGATTTATC GTCCTTCAGG GTCGCGGCGC CGGGGTACTC CGCCGGGGAG
CCGCCCAGGG CTGTCTCCTC CTCCGTGAAG GAGAGAGTCG GCATAGACGT TTCCCGGGCG
AGCTACGTCC AGGTGGGCGA GACTATCTTT GCGAGGGTTC TAGCCGAGAG GCTATTCCGG
GAATACGGGG TCGTGGTGAA GCCCCTCTCG AAGGCTTTGA AGGAGGACGG GCCCGCCGCG
AGGGTAGCTT GGTCCCTGAT AGACCCGGCG ACGGACAAGT ACACCGCCTA CGCCTACCTC
TACGGCGGGG AGGCCGGCTA CTACGTCTAC GTACCGCCAG GCGTCAAGGT CGACGTGCCG
ATATACACCT GCCTGAGCCT CTTCACAGGG GAGGAGGTCC AGTTCGCTCA CAACGTCGTC
TACCTCGACG AGGGGGCGGA GGCTGTCGTC ACGACGGGTT GCCTCGTGCC GCACGGCGTC
AAGGGAGGGC TCCACGTGGG GCTTTCGGAG TTCTTCGTGG GTCCCGGCGC GAGGCTCGTC
TTCACCATGC TCCACGCGTG GGGCGAGGGT ACGCACGTAC GCCCCAGGAC TGCGGTGAGG
GTGGAGGAGG GCGGAGAATA CGTGAGCTAC TACGCCGTGT ACAGCCCGGT AGCGTCCCTG
CAGACGTACC CCAGGGTTTA CCTGTATAGG CGGGCGAGCG CCAAGCTAAC CTCCGTAGTC
GCCGGCACGG GGCCCGGCGT CTACGACGTG GGTGGAGGGG CCGTGCTTCT GGGAGAGGGG
AGCAGCGCGG AGCTCGTCTC CAGGGTGCTC GGGAGCAAGG GGTCTACGAT AGTCTCCAGG
GGGTCTGTAG AGGCGGAGGC CAACGACGTG AAGGGGCACA TAGAGTGCCT TGGGGTCATG
CTCGACCCCT CTTCGACCAT AGAAGCTGTC CCGGTGCTGA AGTCCAGGGT GGAGAGGGCT
GAGCTGACGC ACGAGGCGGC TATAGGCATG CTCTCCGGGG AGAAGATAGA GTACCTTATG
GCTAGGGGGT TCAGCGAGGA GGAGGCGAGG TCTATACTCC TGAGGGGCTA CCTCACAGCG
GAGACCCCGG GCCTACCTGA GAGCGTTAAA GGCGAGATCG AAAGGGTAAT AGAGTACGTC
GTGAGGCACG CCCTCGGGTA G
 
Protein sequence
MRLMERVEAA RKALSKPSPY GPDLDLSSFR VAAPGYSAGE PPRAVSSSVK ERVGIDVSRA 
SYVQVGETIF ARVLAERLFR EYGVVVKPLS KALKEDGPAA RVAWSLIDPA TDKYTAYAYL
YGGEAGYYVY VPPGVKVDVP IYTCLSLFTG EEVQFAHNVV YLDEGAEAVV TTGCLVPHGV
KGGLHVGLSE FFVGPGARLV FTMLHAWGEG THVRPRTAVR VEEGGEYVSY YAVYSPVASL
QTYPRVYLYR RASAKLTSVV AGTGPGVYDV GGGAVLLGEG SSAELVSRVL GSKGSTIVSR
GSVEAEANDV KGHIECLGVM LDPSSTIEAV PVLKSRVERA ELTHEAAIGM LSGEKIEYLM
ARGFSEEEAR SILLRGYLTA ETPGLPESVK GEIERVIEYV VRHALG