Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0059 |
Symbol | |
ID | 4602144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 44153 |
End bp | 45499 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639772813 |
Product | amidohydrolase |
Protein accession | YP_919472 |
Protein GI | 119718977 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.312881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCTACGC TGGTTATCCG CGACGCTGAC ACAATCTTGA CGATGAACAG CGCGAGGCCC GTGCTTAGGC ACCAGTCGAT CCTCGTTGAC GGCAACGTTG TGGCGGCCGT GGGGGATTAT AGCTCGCTTG TCGCGAGCTA CGGGGCTCCC GACGAGGTTA TCGATGGTCG TGGAAAGGTC GTTTTGCCGG GTTTCTACGA CCTGCATACT CACATCGCGA TGGCGGGTTT CAGGGGGTTG GCGGCGGACG CCGGCGACGT TATCTACAGG GTTTTCTGGC CGCTGGAGCG GTTGCTGAGC GGGGAGTCGG TTTACCGCTT CGCCCTGCTA GGCGGGTTGG AGGCTTTGAA GAGTGGCGTG GTGCTTGTGG CTGACCACTA CTTCTTCATG CGGGACGTCG CGAGGGCGCT GGTAGAGCTG GGCTTGAGGG GGCTTCTGGG GCATACGTAC ATGGATAGGG ATGGCCCCTT CACCGGGGAG CGCGAGCTTA GGGAGGCTTT GGACTTCGTC GAGCGGTGGA GGGGGCACGA GCTGATCACA CCCGTGCTGG CGCCCCACGC TCCCGACACC GTCAGCAGGG ATAACCTGCT CTACTTCGCG GAGCTCTCCA GGGAGAAGAA CCTCTTCGTA CACATGCACC TCGCGCAGAC TCTACGTGAG TTTAAAACAG TGAAGGAGGA GACCGGCTAT ACCCCTGTGA GGTATGCTTT GAGGCTGGGG CTCCTAGGAG GGAGGAGCAT AGTGGCTCAC GCGAACTACG TGGACGAGAA CGAGAAAGCC CTTCTGGCGC ACAGTGGTAG CGTGATCGTC CAGTGCCCCT CCACGTACTT CATGTCCGGT ACCCCGTTCC ACGCCTATGA CTACTGGCAG CTTGGCGGCA ACGTCGCGAT AGGCACGGAT GCCCCGTGCT ACAACGACAA CGTGGATTTC TTCGAAGAGA TGAGGCTCCT GGTCTACGGC CAGCGCATGA AGCTCGAGAA GAGCGGTGTT TGGAGGGCCT ACGACGTCCT GGAGATGGCT ACAAGGCTCA GCGCGAGGCT TGTGGGGGTC CGCGGCGGCT TCGTCGGTAA GGGGGCCTTG GCAGACCTCG TGCTCGTGAA CCTTTCCAGC GTGAGGCTTA GACCCTTCCT GGACCCCTTC TCGAACATAG TCTACGCCGC TAGTAGCGGT GACGTGGACA CGGTGATTGT TAACGGCAGG GTAGTACTGA AGGGTGGGAG GCACGTCTCG CTCGACGAGG AGAGGATTGT CAGCCAGGCC GAGGCCGAGG CGAGGATGCT CTTGAGGAGG GCGCTCGACG AGAGCCCCGA GCTCGAGAGC ATAATAGGGA GGGATAAAGA AATATAG
|
Protein sequence | MATLVIRDAD TILTMNSARP VLRHQSILVD GNVVAAVGDY SSLVASYGAP DEVIDGRGKV VLPGFYDLHT HIAMAGFRGL AADAGDVIYR VFWPLERLLS GESVYRFALL GGLEALKSGV VLVADHYFFM RDVARALVEL GLRGLLGHTY MDRDGPFTGE RELREALDFV ERWRGHELIT PVLAPHAPDT VSRDNLLYFA ELSREKNLFV HMHLAQTLRE FKTVKEETGY TPVRYALRLG LLGGRSIVAH ANYVDENEKA LLAHSGSVIV QCPSTYFMSG TPFHAYDYWQ LGGNVAIGTD APCYNDNVDF FEEMRLLVYG QRMKLEKSGV WRAYDVLEMA TRLSARLVGV RGGFVGKGAL ADLVLVNLSS VRLRPFLDPF SNIVYAASSG DVDTVIVNGR VVLKGGRHVS LDEERIVSQA EAEARMLLRR ALDESPELES IIGRDKEI
|
| |