Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0963 |
Symbol | |
ID | 3831238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 994889 |
End bp | 996175 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637828892 |
Product | amidohydrolase |
Protein accession | YP_429821 |
Protein GI | 83589812 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000000377976 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCAAGA TTTTAATCAA GGATTGTACC ATCGTTCCGA TAAGCGGTCC GGTAATTGGG AAGGGGGTTA TCGCCATTAA TGACGACCGC CTCCATTATG TCGGCCCGGC AGGCGGTTTG CCGGCCGGCT GGCAGGCGGA TACCGTAATT GATGCCGGCG ATATGGTGGC CCTGCCGGGC CTGGTCAATG CCCATACCCA CGCCGCCATG ACCCTGCTGC GGAGCTACGC CGACGACCTG CCACTTAAAC AATGGTTGGA AGAAAAAATC TGGCCCCGCG AGGACCGGCT GGAGCGGGAA GATATCTACT GGGGTAGCAA AATCGCCCTG CTGGAGATGA TCCGCTCCGG CACCACCACC TTTGCCGATA TGTATTTCCA TATGGATGCC GTGGCCGGGG CAGTGGTGGA AGCCGGGCTG CGGGCCAGCC TTTGCCAGGG ATTGATAGGC CTTCAGGATA CCAGCAACAA ACGGCTGGAA GCAGGCATCA GTATGGTAAA AGAGTGGCAT GGCGCTGGTG AGGGACGGAT TACCACCATG CTGGGTCCCC ACGCCCCCAA TACCTGCACG CCGGAGTATT TAACCAGGGT TGCCGAGACA GCGGCCGGGC TGGGGGTGGG GTTACATATC CACCTGGCGG AAACGCGGGG AGAGGTCGAA GACGTAAAAG CCCGTTACGG GGCTACCCCG GTAGCCCTGG TCAACAAGCT GGGTCTCCTG GACCTGCCAG TCCTGGCGGC CCACTGCGTC CACCTGACCA CCGAGGAAAT CGCTATCCTG GCCGAGAAAA AGGTCGGCGT GGCCCACTGC CCGGAGAGTA ATCTTAAACT GGCCAGCGGC GTGGCCCCGG TAAAGGAGAT GCTGGCTGCC GGCGTCAACG TGGCCATCGG CACTGACGGC GCCTCCAGCA ACAATAACCT GGACATGGTG GCTGAGACCC GGACGGCGGC CCTGCTGGCT AAAGGCATTA CCGGCGACCC CACGGTAGTA CCCGCCCACC AGGCCCTGGT TATGGCCACC CTGAACGGTG CCCGGGCCCT GGGCCTGGAA AAGGAGATCG GCACCCTGGA AGCGGGTAAA AAGGCCGACC TGATCCTGGT GGACATGCGC CAGCCCCACT TGATGCCGCC CAACGATGTC GAGGCCAACC TGGTTTACGC CGCCCGGGGA AGCGACGTGG ATACGGTTAT CGTTAACGGG AAGATTCTCA TGGCCAGGGG TGAGGTTAAG ACCCTGGATG CCGAAGAGAT TTACGCCCAG GTGACGAAGA GGATGCACAA AGGGTAA
|
Protein sequence | MGKILIKDCT IVPISGPVIG KGVIAINDDR LHYVGPAGGL PAGWQADTVI DAGDMVALPG LVNAHTHAAM TLLRSYADDL PLKQWLEEKI WPREDRLERE DIYWGSKIAL LEMIRSGTTT FADMYFHMDA VAGAVVEAGL RASLCQGLIG LQDTSNKRLE AGISMVKEWH GAGEGRITTM LGPHAPNTCT PEYLTRVAET AAGLGVGLHI HLAETRGEVE DVKARYGATP VALVNKLGLL DLPVLAAHCV HLTTEEIAIL AEKKVGVAHC PESNLKLASG VAPVKEMLAA GVNVAIGTDG ASSNNNLDMV AETRTAALLA KGITGDPTVV PAHQALVMAT LNGARALGLE KEIGTLEAGK KADLILVDMR QPHLMPPNDV EANLVYAARG SDVDTVIVNG KILMARGEVK TLDAEEIYAQ VTKRMHKG
|
| |