Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1599 |
Symbol | |
ID | 6315672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 1675337 |
End bp | 1676641 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 642643970 |
Product | Atrazine chlorohydrolase |
Protein accession | YP_001917761 |
Protein GI | 188586216 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.748458 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 4.4345400000000004e-23 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGACTAAAG TTACATTAAT TGAAAATGCA GTTGTCATTA CTATGGACAA GGAGGATATT TTAAACCCGG GATATATTCT CATTGAAGGC GACAAAATAA AAAAAGTCAG TTCTGAACCC TTTGATGAAG AAACCAAGAA AACTGCTCAC GAAATTATTG ATGCCCAAGG AAACTTCGTT TTACCAGGAC TAATAAACAC CCACACACAT GCAGGAATGA GTTTATTAAG AAGTTACGCG GATGATATGT CATTGATGGA TTGGCTAGAA AATAAAATTT GGCCTGCAGA AGGAAACTTA ACAAGTGAAA GTATTTACTG GGGTACATTA CTATCTATAG TAGAAATGAT TAAAAGTGGA ACCACTACCT TTGCTGATAT GTACTTTTTT ATGGATGATG TGGCTAGAGC CGTTCAAGAA AGTGGTATGA GAGCTTCACT ATCCAGAGGG ATGATTGGAT TTAAAGGTGA TGACAGCTTA TATGAGGCTA AAGAATTTGT TAAGAAATGG CATAATGGCG CAGAAGGAAG AATAACTTGT ATGCTAGGAC CTCATGCTCC GTATACATGT CCTCCTGAAT TTTTGAACAA AACTTTGTCA ATGGCTCATG AACTTGAAAT GCCTATACAT ATCCACCTTT CAGAAACGGA AGGTGAAGTT ACAGATAACT ATAAAGAATA TAATAAATCA CCAGTAGAGC ATCTTAATGA GTTAGGAATA TTTGATGTTC CAACTTTAGC TGCTCACTGC GTACATGTTA ACGATGAAGA TATCAGGATA TTAGCGGATA ATAACGTCTC TGTCAGTCAC AACATTGGCA GTAATTTAAA ATTGGGATCT GGAATTGCAC CAATTGATAA AATGTTATCA GAAAATGTAA CAGTTTCTTT AGGGACCGAT GGAGCTTCAT CCAATAATAA TCTCGACCTT TTAGAAGAAG TTAGACTAAG TTCTTTGGTA CAAAAAGGAT TTCATGAAAA TCCAACTTTA ATTAATGCTT ACACTGCTTT AGAAATGGCA ACAATAAAAG GAGGAGAAAC TCTAAAATTA CCAGAGGTAG GTAAGTTAGC TCCCGAATAT AAGGCAGATA TTATTGTAAT TGATAAGAAC TCAGCCGAGT TATATCCAAG ACACGATCCT ATCGCTAATA TTGTTTATTC ATGTAATTCT AATAATGTTT CAACTGTAAT AATTGATGGG AAAATAGTAA TGAAAGACGG AAATTTACAA ACTATCGATG AAGAAAAAGT TTATCATGAA GCTGATAAAC ATGCAAAAAT GATTACTGAT CAAGATAAGA ATTAA
|
Protein sequence | MTKVTLIENA VVITMDKEDI LNPGYILIEG DKIKKVSSEP FDEETKKTAH EIIDAQGNFV LPGLINTHTH AGMSLLRSYA DDMSLMDWLE NKIWPAEGNL TSESIYWGTL LSIVEMIKSG TTTFADMYFF MDDVARAVQE SGMRASLSRG MIGFKGDDSL YEAKEFVKKW HNGAEGRITC MLGPHAPYTC PPEFLNKTLS MAHELEMPIH IHLSETEGEV TDNYKEYNKS PVEHLNELGI FDVPTLAAHC VHVNDEDIRI LADNNVSVSH NIGSNLKLGS GIAPIDKMLS ENVTVSLGTD GASSNNNLDL LEEVRLSSLV QKGFHENPTL INAYTALEMA TIKGGETLKL PEVGKLAPEY KADIIVIDKN SAELYPRHDP IANIVYSCNS NNVSTVIIDG KIVMKDGNLQ TIDEEKVYHE ADKHAKMITD QDKN
|
| |