Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1148 |
Symbol | |
ID | 6315713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 1213863 |
End bp | 1215200 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 642643520 |
Product | N-ethylammeline chlorohydrolase |
Protein accession | YP_001917319 |
Protein GI | 188585774 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.356182 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACAA CTTTAATACA AAATGGCTTG TTAGTTACGA TGAATAAAGA TAGAGAGATA TATACAGGAG ATATTCTAAT AAAAGACAAT AAAATATCAA AAATAAGCAG TGAAAGCATT TCAACCAATG TGGACCAAGT GATTGATGCT ACAGACAAAG TTATTATTCC TGGTATGATA CAACCCCATG TACATCTTAC TCAAACACTT TTTAGAGGCC AAGCCGATGA TTTAGAACTT CTTGATTGGT TAAAAAATAG AATATGGCCT TTAGAGGGGG CTCATACGGA TCAATCTAAT TATATTTCCG CATATCTGGG AATTGCAGAA CTGATAAAAG GTGGAACAAC TTCAATTATC GATATGGAAA CAGTCCATCA TACTGAAGCT GCCTTAAAAG CCATTTATGA CACAGGTTAT CGAGCTGTTA CCGGTAAATG TATAATGGAC GATGGTGGGG ATATCCCTGA AACATTAAGA GAAACAACTA AAGAATCTAT ACAAGAAAGT GTTAGATTAT TGGAAAAGTG GCACAATCAG GGTAATGGAA GAATTAAGTA CGGATTCGCA CCTAGATTTG CCATATCAAG TAGCCAAAAA GCCCTTTCCC AAGTGAGGGA TTTAGCTCGG GAATACGGAG TATTAATTCA TACTCATGCT TCTGAAAATC AATATGAAAC CAGCTTAGTT GAAGAAAAAA CAGGTTTACG AAATGTTAAA TTATTTGAAA AACTAGGTTT AACTGGTGAA GATTTAATAC TGGCTCATTG TATCTGGCTG AATGAAGAAG AAATGGAGAT CTTAACAAGT ACTGGTACTA AAATTGTACA CTGTCCTAGT TCTAATTTAA AACTTGCTTC AGGAATTGCT AAAATACCTG ATTTGTTAAA AATGGGGGCG AATGTCTCTT TAGCTTCAGA TGGTGCACCT TGTAACAATA ATATGGATAT GTTTGTTGAA ATGCGAAATG CCGCATTGAT ACATAAAGCA TTTAACTTAG ATCCAACAGT TATAAATGCA GAAAAAGTTT TTGAAATGGC TACTCTAGGT GGTGCCAAGG CCATGGGTAT GGAGGAACAG TTAGGTAGTA TTGAAGAAGG AAAATTAGCT GATTTGGCAA TTGTAGATTT GAATGGGGTA CATGTGGCTC CACGAACAGG TGAGGACGTG ATTGCCAAGT TAGTATATTG TGCTAGGGCA ACGGATGTAA CTACTACGAT TATTGACGGA AAAATTGTAA TGGAGGAACA ACAACTTACC ACTATAGATG AAGAAGCGGT AAAAAAAGAG GCCAATAAAC TTTTAGATAA TCAAATCAAA AGAGCAGGCT TGGATTAG
|
Protein sequence | MTTTLIQNGL LVTMNKDREI YTGDILIKDN KISKISSESI STNVDQVIDA TDKVIIPGMI QPHVHLTQTL FRGQADDLEL LDWLKNRIWP LEGAHTDQSN YISAYLGIAE LIKGGTTSII DMETVHHTEA ALKAIYDTGY RAVTGKCIMD DGGDIPETLR ETTKESIQES VRLLEKWHNQ GNGRIKYGFA PRFAISSSQK ALSQVRDLAR EYGVLIHTHA SENQYETSLV EEKTGLRNVK LFEKLGLTGE DLILAHCIWL NEEEMEILTS TGTKIVHCPS SNLKLASGIA KIPDLLKMGA NVSLASDGAP CNNNMDMFVE MRNAALIHKA FNLDPTVINA EKVFEMATLG GAKAMGMEEQ LGSIEEGKLA DLAIVDLNGV HVAPRTGEDV IAKLVYCARA TDVTTTIIDG KIVMEEQQLT TIDEEAVKKE ANKLLDNQIK RAGLD
|
| |