Gene Nther_1599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1599 
Symbol 
ID6315672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1675337 
End bp1676641 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content34% 
IMG OID642643970 
ProductAtrazine chlorohydrolase 
Protein accessionYP_001917761 
Protein GI188586216 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.748458 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value4.4345400000000004e-23 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGACTAAAG TTACATTAAT TGAAAATGCA GTTGTCATTA CTATGGACAA GGAGGATATT 
TTAAACCCGG GATATATTCT CATTGAAGGC GACAAAATAA AAAAAGTCAG TTCTGAACCC
TTTGATGAAG AAACCAAGAA AACTGCTCAC GAAATTATTG ATGCCCAAGG AAACTTCGTT
TTACCAGGAC TAATAAACAC CCACACACAT GCAGGAATGA GTTTATTAAG AAGTTACGCG
GATGATATGT CATTGATGGA TTGGCTAGAA AATAAAATTT GGCCTGCAGA AGGAAACTTA
ACAAGTGAAA GTATTTACTG GGGTACATTA CTATCTATAG TAGAAATGAT TAAAAGTGGA
ACCACTACCT TTGCTGATAT GTACTTTTTT ATGGATGATG TGGCTAGAGC CGTTCAAGAA
AGTGGTATGA GAGCTTCACT ATCCAGAGGG ATGATTGGAT TTAAAGGTGA TGACAGCTTA
TATGAGGCTA AAGAATTTGT TAAGAAATGG CATAATGGCG CAGAAGGAAG AATAACTTGT
ATGCTAGGAC CTCATGCTCC GTATACATGT CCTCCTGAAT TTTTGAACAA AACTTTGTCA
ATGGCTCATG AACTTGAAAT GCCTATACAT ATCCACCTTT CAGAAACGGA AGGTGAAGTT
ACAGATAACT ATAAAGAATA TAATAAATCA CCAGTAGAGC ATCTTAATGA GTTAGGAATA
TTTGATGTTC CAACTTTAGC TGCTCACTGC GTACATGTTA ACGATGAAGA TATCAGGATA
TTAGCGGATA ATAACGTCTC TGTCAGTCAC AACATTGGCA GTAATTTAAA ATTGGGATCT
GGAATTGCAC CAATTGATAA AATGTTATCA GAAAATGTAA CAGTTTCTTT AGGGACCGAT
GGAGCTTCAT CCAATAATAA TCTCGACCTT TTAGAAGAAG TTAGACTAAG TTCTTTGGTA
CAAAAAGGAT TTCATGAAAA TCCAACTTTA ATTAATGCTT ACACTGCTTT AGAAATGGCA
ACAATAAAAG GAGGAGAAAC TCTAAAATTA CCAGAGGTAG GTAAGTTAGC TCCCGAATAT
AAGGCAGATA TTATTGTAAT TGATAAGAAC TCAGCCGAGT TATATCCAAG ACACGATCCT
ATCGCTAATA TTGTTTATTC ATGTAATTCT AATAATGTTT CAACTGTAAT AATTGATGGG
AAAATAGTAA TGAAAGACGG AAATTTACAA ACTATCGATG AAGAAAAAGT TTATCATGAA
GCTGATAAAC ATGCAAAAAT GATTACTGAT CAAGATAAGA ATTAA
 
Protein sequence
MTKVTLIENA VVITMDKEDI LNPGYILIEG DKIKKVSSEP FDEETKKTAH EIIDAQGNFV 
LPGLINTHTH AGMSLLRSYA DDMSLMDWLE NKIWPAEGNL TSESIYWGTL LSIVEMIKSG
TTTFADMYFF MDDVARAVQE SGMRASLSRG MIGFKGDDSL YEAKEFVKKW HNGAEGRITC
MLGPHAPYTC PPEFLNKTLS MAHELEMPIH IHLSETEGEV TDNYKEYNKS PVEHLNELGI
FDVPTLAAHC VHVNDEDIRI LADNNVSVSH NIGSNLKLGS GIAPIDKMLS ENVTVSLGTD
GASSNNNLDL LEEVRLSSLV QKGFHENPTL INAYTALEMA TIKGGETLKL PEVGKLAPEY
KADIIVIDKN SAELYPRHDP IANIVYSCNS NNVSTVIIDG KIVMKDGNLQ TIDEEKVYHE
ADKHAKMITD QDKN