Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1199 |
Symbol | |
ID | 4810152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1429136 |
End bp | 1430431 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106622 |
Product | amidohydrolase |
Protein accession | YP_001037624 |
Protein GI | 125973714 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.242168 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACATAC TGATAAAGAA TGCCGACATA ATTACCTGCA ATGCGTCAGA TGACGTGTTG CAGGGTGCGT TTTTGGGCAT AAAGGATGGA TATATTGATT TTATAGATAC AAAAGAAGAT GCTTTAAAAG ACTTTAAGGC CGACCGGATT ATTGACGCAA AGGGAAAACT GGTTATGCCG GGTTTGGTGA ATGCCCACAC CCACAGCGGG ATGACAATAC TCAGGAACTT TGCAAACGAC CTTGCATTGG AAGACTGGCT TTTCGGCAAT GTACTTCCCG TGGAAGAGAA ACTTACACCG GAAGACATAT ACTGGGGTAC ATTGCTGGGA ATAGCCGAGA TGATAAAATC AGGCACTACG ACTTTTGCCG ACATGTATCT TCATATGGAA GAAGTGGCAA GGGCTGTTTC GGAAACGGGC ATAAGGGCAA ATCTTTGCAG AAGTCCGCTT AAAGACAGCG ATAAAAGTGT GGAAGATGCC GTTCGGTGTT TTGAATATTT TAAGAAGTGG GACAACAGCT TTAACGGTAG AATAAAAGTG TACATTGAAG TTCACTCGGT TTATCTTTTT GACGAACCGT CGCTGCGTAT GTCGGCGGAA GTTGCAAAAG AGATCAACAC AGGAATTCAC ATACATGTGC AGGAGACTTT GAAAGAGTGT GAGGACAGCA ACAAAAAGTA TGGTATGAGT CCTGCGGAAA TTTGCTGTAA GACCGGCATT TTTGACGTTC CGGTAATCGC TGCCCACTGT GTGCATTTGT CCGACGGGGA TATGGGTATA ATCAGGGATA AGGGCGTAAA TGTGATCCAC AACCCCACCA GCAATTTAAA GCTGGGCAGC GGAATAGCCA AAGTGGATGA TATGCTCAAA AACGGTATCA ATGTGGCTTT GGGAACTGAT GGTGCCGCAA GCAACAATAA TCTTAACATG TTTGAAGAAA TGCATTTGGC GGCGCTGATA CACAAAGGGG TTCACATGGA TCCCACATTG ATTGGTGCTT CCTGTGCATT AAAGATGGCA ACCGTAAACG GAGCAAAGGC ACTTGGGTTT GGAGGCGAGA TTGGAGAAAT TTCAAAGGGA ATGAAGGCGG ACCTTATCCT TATAGATATG GACAAGACGC ATCTGTGCCC TGTTAACGAC CCTGTTTCGG CCGTGGTATA CTCCGCGCAA AGCTCGGACG TTGACACGGT AATAATTGAC GGCAATATTG TGATGGAAAA CAGAGAGCTT AAGACCATAG ATGAGGAAAA AGTAAAATTT AATGTTAAGG AAATTGCCAA AAGAGTATTG AGATAA
|
Protein sequence | MNILIKNADI ITCNASDDVL QGAFLGIKDG YIDFIDTKED ALKDFKADRI IDAKGKLVMP GLVNAHTHSG MTILRNFAND LALEDWLFGN VLPVEEKLTP EDIYWGTLLG IAEMIKSGTT TFADMYLHME EVARAVSETG IRANLCRSPL KDSDKSVEDA VRCFEYFKKW DNSFNGRIKV YIEVHSVYLF DEPSLRMSAE VAKEINTGIH IHVQETLKEC EDSNKKYGMS PAEICCKTGI FDVPVIAAHC VHLSDGDMGI IRDKGVNVIH NPTSNLKLGS GIAKVDDMLK NGINVALGTD GAASNNNLNM FEEMHLAALI HKGVHMDPTL IGASCALKMA TVNGAKALGF GGEIGEISKG MKADLILIDM DKTHLCPVND PVSAVVYSAQ SSDVDTVIID GNIVMENREL KTIDEEKVKF NVKEIAKRVL R
|
| |