Gene Tpen_0059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0059 
Symbol 
ID4602144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp44153 
End bp45499 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content61% 
IMG OID639772813 
Productamidohydrolase 
Protein accessionYP_919472 
Protein GI119718977 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.312881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTACGC TGGTTATCCG CGACGCTGAC ACAATCTTGA CGATGAACAG CGCGAGGCCC 
GTGCTTAGGC ACCAGTCGAT CCTCGTTGAC GGCAACGTTG TGGCGGCCGT GGGGGATTAT
AGCTCGCTTG TCGCGAGCTA CGGGGCTCCC GACGAGGTTA TCGATGGTCG TGGAAAGGTC
GTTTTGCCGG GTTTCTACGA CCTGCATACT CACATCGCGA TGGCGGGTTT CAGGGGGTTG
GCGGCGGACG CCGGCGACGT TATCTACAGG GTTTTCTGGC CGCTGGAGCG GTTGCTGAGC
GGGGAGTCGG TTTACCGCTT CGCCCTGCTA GGCGGGTTGG AGGCTTTGAA GAGTGGCGTG
GTGCTTGTGG CTGACCACTA CTTCTTCATG CGGGACGTCG CGAGGGCGCT GGTAGAGCTG
GGCTTGAGGG GGCTTCTGGG GCATACGTAC ATGGATAGGG ATGGCCCCTT CACCGGGGAG
CGCGAGCTTA GGGAGGCTTT GGACTTCGTC GAGCGGTGGA GGGGGCACGA GCTGATCACA
CCCGTGCTGG CGCCCCACGC TCCCGACACC GTCAGCAGGG ATAACCTGCT CTACTTCGCG
GAGCTCTCCA GGGAGAAGAA CCTCTTCGTA CACATGCACC TCGCGCAGAC TCTACGTGAG
TTTAAAACAG TGAAGGAGGA GACCGGCTAT ACCCCTGTGA GGTATGCTTT GAGGCTGGGG
CTCCTAGGAG GGAGGAGCAT AGTGGCTCAC GCGAACTACG TGGACGAGAA CGAGAAAGCC
CTTCTGGCGC ACAGTGGTAG CGTGATCGTC CAGTGCCCCT CCACGTACTT CATGTCCGGT
ACCCCGTTCC ACGCCTATGA CTACTGGCAG CTTGGCGGCA ACGTCGCGAT AGGCACGGAT
GCCCCGTGCT ACAACGACAA CGTGGATTTC TTCGAAGAGA TGAGGCTCCT GGTCTACGGC
CAGCGCATGA AGCTCGAGAA GAGCGGTGTT TGGAGGGCCT ACGACGTCCT GGAGATGGCT
ACAAGGCTCA GCGCGAGGCT TGTGGGGGTC CGCGGCGGCT TCGTCGGTAA GGGGGCCTTG
GCAGACCTCG TGCTCGTGAA CCTTTCCAGC GTGAGGCTTA GACCCTTCCT GGACCCCTTC
TCGAACATAG TCTACGCCGC TAGTAGCGGT GACGTGGACA CGGTGATTGT TAACGGCAGG
GTAGTACTGA AGGGTGGGAG GCACGTCTCG CTCGACGAGG AGAGGATTGT CAGCCAGGCC
GAGGCCGAGG CGAGGATGCT CTTGAGGAGG GCGCTCGACG AGAGCCCCGA GCTCGAGAGC
ATAATAGGGA GGGATAAAGA AATATAG
 
Protein sequence
MATLVIRDAD TILTMNSARP VLRHQSILVD GNVVAAVGDY SSLVASYGAP DEVIDGRGKV 
VLPGFYDLHT HIAMAGFRGL AADAGDVIYR VFWPLERLLS GESVYRFALL GGLEALKSGV
VLVADHYFFM RDVARALVEL GLRGLLGHTY MDRDGPFTGE RELREALDFV ERWRGHELIT
PVLAPHAPDT VSRDNLLYFA ELSREKNLFV HMHLAQTLRE FKTVKEETGY TPVRYALRLG
LLGGRSIVAH ANYVDENEKA LLAHSGSVIV QCPSTYFMSG TPFHAYDYWQ LGGNVAIGTD
APCYNDNVDF FEEMRLLVYG QRMKLEKSGV WRAYDVLEMA TRLSARLVGV RGGFVGKGAL
ADLVLVNLSS VRLRPFLDPF SNIVYAASSG DVDTVIVNGR VVLKGGRHVS LDEERIVSQA
EAEARMLLRR ALDESPELES IIGRDKEI