Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3598 |
Symbol | |
ID | 4443909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 4038478 |
End bp | 4039965 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639691422 |
Product | N-ethylammeline chlorohydrolase |
Protein accession | YP_833073 |
Protein GI | 116672140 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTACACGA AGATCTCGGC CCGGTTCGTT CTGGGGTTCG ACGGAACCCG GCACACGCTC ATCTCCGACG GCGAAGTCGT CTTCGAAGGA GATTCCATCA TCTTCGTGGG TCGCAACTAT GAGGGCCCGG TTGATGAGGA ACGCGACTTT GGCCAGAGCC TGGTGATGCC TGGACTGATC GACCTGGATG CCCTCGCCGA CATCGACCAC CTCATCCTGG ACTCGTGGCC CTCGCCTGAC GTCGCCGCCG GCCACCTGTG GTCGGACGAC TACTTCGCCA GCCGCCGCCG GGACGTGTTC ACGCCGTCGG AACGCGCCAC CATCCGCGAA TTCGCCCTTG CCCAGCTCGC CCTGCACGGC ATTACCACCT ACATGCCCAT CGCCTCCGAG ATCCACAGTT CCTGGGCCGA AGGCTTCGAC GAGCTCGTGG ACATGGCAGA GACCAGCCGA CGGATCGGCC TGCGGGGCTA CCTCGGCCCC GCCTACCGTT CCGGGGTCCA CGTCACCACT GCAGCAGGCA GCCGCGAGGT CCACTTTGAC GAAGCCCGGG GGGTTGCCGG CCTGAGCGAC GCTGAGCGTT TCCTCGACCA TGCCGCCGGG CTAAATGACC CCCTCGTCAC CGGCGTATTG CTGCCGTGCC GCATTGAGAC GCTGTCCGAA AACCTCATGC GGGAGACGGC GAGGATCGCC CGCGATCGGG ACGCGATTGT CCGGCTCCAC TGCCTCCAGT CGCCCCTCGA GGACGAGCTG CTGCAGCGTT CAGCCGGACG CGGCGTCCTG GAACTGCTCG AATCCACCGG CCTGTTCGGC ACCCGCCTTC TCATTCCGCA CGGGGTGGTG ATTAGCGGCA AGGACCCTGC CGCGTCGGCT CCCGGAGGTC CGCTGGACGT GCTGGCCCGG CACGGCGTCA GCATTGTCCA CTGCCCGCTG ACCTCGTTCC GCTACCAGAA GCAGCTCGAT TCGTTCGACC GTTTCCGCGA GGCCGGAATC AATATGTGCC TGGGTACCGA CTCTTTCCCG CCGGACCTGG TGCGGGGCAT GGACGTGGGC ATGCACCTGA CCCGGATGGT TGAGGGGAGG GCCGACGCCG GGACCCTGGC CGACTACTTT GACGCCGCGA CCCTCGGCGG CGCGCGAGCT CTTGGCCGGA AAGACCTCGG GAGGCTGGCG CCCGGCATGC AGGCCGACAT CACGGTGTTC TCCCTGGGCC ACTTCGGCGA CGGCGTCGTC GAGGATCCGC TTCGCACCCT GGTGCTCAAC GGCACGGCGC GGCAGGTTAC GGATACTTTT GTGGCCGGCC GCCCCGTGGT GGTGGACGGT GCCCTGCCCG GCGTTGACCT GGACGCGCTG CGGGCCGGGG GTCAGGGGCT GTTCGAGGCG ATGCGGGCTG CCTACTCGGA GCGGGACGTC CGGCGCCGTG CTTCCGATGA GCTGTTTCCG CCCACCTATC CGCACGCCGA AATTGGCCGT CCGGTTGTTG TCCCGTAG
|
Protein sequence | MYTKISARFV LGFDGTRHTL ISDGEVVFEG DSIIFVGRNY EGPVDEERDF GQSLVMPGLI DLDALADIDH LILDSWPSPD VAAGHLWSDD YFASRRRDVF TPSERATIRE FALAQLALHG ITTYMPIASE IHSSWAEGFD ELVDMAETSR RIGLRGYLGP AYRSGVHVTT AAGSREVHFD EARGVAGLSD AERFLDHAAG LNDPLVTGVL LPCRIETLSE NLMRETARIA RDRDAIVRLH CLQSPLEDEL LQRSAGRGVL ELLESTGLFG TRLLIPHGVV ISGKDPAASA PGGPLDVLAR HGVSIVHCPL TSFRYQKQLD SFDRFREAGI NMCLGTDSFP PDLVRGMDVG MHLTRMVEGR ADAGTLADYF DAATLGGARA LGRKDLGRLA PGMQADITVF SLGHFGDGVV EDPLRTLVLN GTARQVTDTF VAGRPVVVDG ALPGVDLDAL RAGGQGLFEA MRAAYSERDV RRRASDELFP PTYPHAEIGR PVVVP
|
| |