Gene Nther_1148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1148 
Symbol 
ID6315713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1213863 
End bp1215200 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content36% 
IMG OID642643520 
ProductN-ethylammeline chlorohydrolase 
Protein accessionYP_001917319 
Protein GI188585774 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.356182 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAA CTTTAATACA AAATGGCTTG TTAGTTACGA TGAATAAAGA TAGAGAGATA 
TATACAGGAG ATATTCTAAT AAAAGACAAT AAAATATCAA AAATAAGCAG TGAAAGCATT
TCAACCAATG TGGACCAAGT GATTGATGCT ACAGACAAAG TTATTATTCC TGGTATGATA
CAACCCCATG TACATCTTAC TCAAACACTT TTTAGAGGCC AAGCCGATGA TTTAGAACTT
CTTGATTGGT TAAAAAATAG AATATGGCCT TTAGAGGGGG CTCATACGGA TCAATCTAAT
TATATTTCCG CATATCTGGG AATTGCAGAA CTGATAAAAG GTGGAACAAC TTCAATTATC
GATATGGAAA CAGTCCATCA TACTGAAGCT GCCTTAAAAG CCATTTATGA CACAGGTTAT
CGAGCTGTTA CCGGTAAATG TATAATGGAC GATGGTGGGG ATATCCCTGA AACATTAAGA
GAAACAACTA AAGAATCTAT ACAAGAAAGT GTTAGATTAT TGGAAAAGTG GCACAATCAG
GGTAATGGAA GAATTAAGTA CGGATTCGCA CCTAGATTTG CCATATCAAG TAGCCAAAAA
GCCCTTTCCC AAGTGAGGGA TTTAGCTCGG GAATACGGAG TATTAATTCA TACTCATGCT
TCTGAAAATC AATATGAAAC CAGCTTAGTT GAAGAAAAAA CAGGTTTACG AAATGTTAAA
TTATTTGAAA AACTAGGTTT AACTGGTGAA GATTTAATAC TGGCTCATTG TATCTGGCTG
AATGAAGAAG AAATGGAGAT CTTAACAAGT ACTGGTACTA AAATTGTACA CTGTCCTAGT
TCTAATTTAA AACTTGCTTC AGGAATTGCT AAAATACCTG ATTTGTTAAA AATGGGGGCG
AATGTCTCTT TAGCTTCAGA TGGTGCACCT TGTAACAATA ATATGGATAT GTTTGTTGAA
ATGCGAAATG CCGCATTGAT ACATAAAGCA TTTAACTTAG ATCCAACAGT TATAAATGCA
GAAAAAGTTT TTGAAATGGC TACTCTAGGT GGTGCCAAGG CCATGGGTAT GGAGGAACAG
TTAGGTAGTA TTGAAGAAGG AAAATTAGCT GATTTGGCAA TTGTAGATTT GAATGGGGTA
CATGTGGCTC CACGAACAGG TGAGGACGTG ATTGCCAAGT TAGTATATTG TGCTAGGGCA
ACGGATGTAA CTACTACGAT TATTGACGGA AAAATTGTAA TGGAGGAACA ACAACTTACC
ACTATAGATG AAGAAGCGGT AAAAAAAGAG GCCAATAAAC TTTTAGATAA TCAAATCAAA
AGAGCAGGCT TGGATTAG
 
Protein sequence
MTTTLIQNGL LVTMNKDREI YTGDILIKDN KISKISSESI STNVDQVIDA TDKVIIPGMI 
QPHVHLTQTL FRGQADDLEL LDWLKNRIWP LEGAHTDQSN YISAYLGIAE LIKGGTTSII
DMETVHHTEA ALKAIYDTGY RAVTGKCIMD DGGDIPETLR ETTKESIQES VRLLEKWHNQ
GNGRIKYGFA PRFAISSSQK ALSQVRDLAR EYGVLIHTHA SENQYETSLV EEKTGLRNVK
LFEKLGLTGE DLILAHCIWL NEEEMEILTS TGTKIVHCPS SNLKLASGIA KIPDLLKMGA
NVSLASDGAP CNNNMDMFVE MRNAALIHKA FNLDPTVINA EKVFEMATLG GAKAMGMEEQ
LGSIEEGKLA DLAIVDLNGV HVAPRTGEDV IAKLVYCARA TDVTTTIIDG KIVMEEQQLT
TIDEEAVKKE ANKLLDNQIK RAGLD