Gene Tpen_0153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0153 
Symbol 
ID4600645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp130229 
End bp131290 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content60% 
IMG OID639772907 
Productphosphoadenosine phosphosulfate reductase 
Protein accessionYP_919566 
Protein GI119719071 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0175] 3'-phosphoadenosine 5'-phosphosulfate sulfotransferase (PAPS reductase)/FAD synthetase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.327935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAGT ACCCCTTTCT TTCTCCCAGC GGCAGGAGGG GGTACGTCAG GTCGTTCTCC 
GACTACATAG AAGTGGAGTA CGAGGGCGAA CCCTCCTCCC TCTACCTGTG GTCGCAGGGT
AGCGGCTCTC TCTACAGGGG TAGCACGGGC TTCTTTAAAA GCGTCTACGA GCCTCCCCTA
GCCGAGAAGG GGTGGAGCAC CGAAGCCCCC GTTGAAGACC GGGTGGCCGA GGTCGCGCGG
AGGCATGGCG AGAAGCTTAA GGGTAAGCAC GTAATGGCTG ACCTCAGCGG CGGGAAGGAC
AGCACGGCTA ACCTCTACCT ACTCACAAAG CTCCAGGAAA TCGTGGGCTT CAAGGTAACG
GCAGTCTACG TGCACATGCC GTACCTCGAG CCCGTGGAGA ACATAGCCTT CGCGGAGAAA
GTAGCGTCAA GACTGGGAGT CGACCTGAGA ATAGTCGAGC CGGACAGGAG GAAGCTGGAG
TTCTACCTGC TGAGGGAGGG GCTCCCGAAG CGCGGCGACA GGTGGTGCAC GTACCTGAAG
ACTCGCGCCC TAAGAGAGGC GAAGAAGGAG ATAGGGGCCG AGGTTGAGGC TAAAGCCGAG
AGGGCGCTCG AGGCCGGGAA GCGCTACGAA AGGCTCAGCG GCCTCGCTAA GAGAAAAGTG
TACTTCAACG GGGGAGTGGT AAACCTCGTC CATGACCTCT CCGCGGCGGA AGTCGCGGGA
ATAGTTAGGC GCGCCGGCCT CGTACACCCC CACTACCTTC AAGGGTTACC CCGCGTGAGC
TGTAGGTTCT GCCCGTACAG AGGGCTCTAC GAGCTCGAGG TCTCCTCGAA GCACGAAGTC
GAGGACGAGG GTCTCGTGGA GTGGGTCATG GCGAGGACCT ATAGGAACTA CTACTCGAGC
GTTACGCCGC TAGAAACGTT CCTAGAGCTA CACTTGTGGC GCTACACCCC CTCCGTGGCT
AGGCTCCGCG TGCTGGAAGC GGGCTACGTC GACCCAGACT CGAAAATCTC GCTGAGCGAA
GCAAGGAAAA TGTTCTCGTG GATATGGGTG GGCAAGGCGT GA
 
Protein sequence
MEKYPFLSPS GRRGYVRSFS DYIEVEYEGE PSSLYLWSQG SGSLYRGSTG FFKSVYEPPL 
AEKGWSTEAP VEDRVAEVAR RHGEKLKGKH VMADLSGGKD STANLYLLTK LQEIVGFKVT
AVYVHMPYLE PVENIAFAEK VASRLGVDLR IVEPDRRKLE FYLLREGLPK RGDRWCTYLK
TRALREAKKE IGAEVEAKAE RALEAGKRYE RLSGLAKRKV YFNGGVVNLV HDLSAAEVAG
IVRRAGLVHP HYLQGLPRVS CRFCPYRGLY ELEVSSKHEV EDEGLVEWVM ARTYRNYYSS
VTPLETFLEL HLWRYTPSVA RLRVLEAGYV DPDSKISLSE ARKMFSWIWV GKA