Gene Arth_3598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3598 
Symbol 
ID4443909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4038478 
End bp4039965 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content67% 
IMG OID639691422 
ProductN-ethylammeline chlorohydrolase 
Protein accessionYP_833073 
Protein GI116672140 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTACACGA AGATCTCGGC CCGGTTCGTT CTGGGGTTCG ACGGAACCCG GCACACGCTC 
ATCTCCGACG GCGAAGTCGT CTTCGAAGGA GATTCCATCA TCTTCGTGGG TCGCAACTAT
GAGGGCCCGG TTGATGAGGA ACGCGACTTT GGCCAGAGCC TGGTGATGCC TGGACTGATC
GACCTGGATG CCCTCGCCGA CATCGACCAC CTCATCCTGG ACTCGTGGCC CTCGCCTGAC
GTCGCCGCCG GCCACCTGTG GTCGGACGAC TACTTCGCCA GCCGCCGCCG GGACGTGTTC
ACGCCGTCGG AACGCGCCAC CATCCGCGAA TTCGCCCTTG CCCAGCTCGC CCTGCACGGC
ATTACCACCT ACATGCCCAT CGCCTCCGAG ATCCACAGTT CCTGGGCCGA AGGCTTCGAC
GAGCTCGTGG ACATGGCAGA GACCAGCCGA CGGATCGGCC TGCGGGGCTA CCTCGGCCCC
GCCTACCGTT CCGGGGTCCA CGTCACCACT GCAGCAGGCA GCCGCGAGGT CCACTTTGAC
GAAGCCCGGG GGGTTGCCGG CCTGAGCGAC GCTGAGCGTT TCCTCGACCA TGCCGCCGGG
CTAAATGACC CCCTCGTCAC CGGCGTATTG CTGCCGTGCC GCATTGAGAC GCTGTCCGAA
AACCTCATGC GGGAGACGGC GAGGATCGCC CGCGATCGGG ACGCGATTGT CCGGCTCCAC
TGCCTCCAGT CGCCCCTCGA GGACGAGCTG CTGCAGCGTT CAGCCGGACG CGGCGTCCTG
GAACTGCTCG AATCCACCGG CCTGTTCGGC ACCCGCCTTC TCATTCCGCA CGGGGTGGTG
ATTAGCGGCA AGGACCCTGC CGCGTCGGCT CCCGGAGGTC CGCTGGACGT GCTGGCCCGG
CACGGCGTCA GCATTGTCCA CTGCCCGCTG ACCTCGTTCC GCTACCAGAA GCAGCTCGAT
TCGTTCGACC GTTTCCGCGA GGCCGGAATC AATATGTGCC TGGGTACCGA CTCTTTCCCG
CCGGACCTGG TGCGGGGCAT GGACGTGGGC ATGCACCTGA CCCGGATGGT TGAGGGGAGG
GCCGACGCCG GGACCCTGGC CGACTACTTT GACGCCGCGA CCCTCGGCGG CGCGCGAGCT
CTTGGCCGGA AAGACCTCGG GAGGCTGGCG CCCGGCATGC AGGCCGACAT CACGGTGTTC
TCCCTGGGCC ACTTCGGCGA CGGCGTCGTC GAGGATCCGC TTCGCACCCT GGTGCTCAAC
GGCACGGCGC GGCAGGTTAC GGATACTTTT GTGGCCGGCC GCCCCGTGGT GGTGGACGGT
GCCCTGCCCG GCGTTGACCT GGACGCGCTG CGGGCCGGGG GTCAGGGGCT GTTCGAGGCG
ATGCGGGCTG CCTACTCGGA GCGGGACGTC CGGCGCCGTG CTTCCGATGA GCTGTTTCCG
CCCACCTATC CGCACGCCGA AATTGGCCGT CCGGTTGTTG TCCCGTAG
 
Protein sequence
MYTKISARFV LGFDGTRHTL ISDGEVVFEG DSIIFVGRNY EGPVDEERDF GQSLVMPGLI 
DLDALADIDH LILDSWPSPD VAAGHLWSDD YFASRRRDVF TPSERATIRE FALAQLALHG
ITTYMPIASE IHSSWAEGFD ELVDMAETSR RIGLRGYLGP AYRSGVHVTT AAGSREVHFD
EARGVAGLSD AERFLDHAAG LNDPLVTGVL LPCRIETLSE NLMRETARIA RDRDAIVRLH
CLQSPLEDEL LQRSAGRGVL ELLESTGLFG TRLLIPHGVV ISGKDPAASA PGGPLDVLAR
HGVSIVHCPL TSFRYQKQLD SFDRFREAGI NMCLGTDSFP PDLVRGMDVG MHLTRMVEGR
ADAGTLADYF DAATLGGARA LGRKDLGRLA PGMQADITVF SLGHFGDGVV EDPLRTLVLN
GTARQVTDTF VAGRPVVVDG ALPGVDLDAL RAGGQGLFEA MRAAYSERDV RRRASDELFP
PTYPHAEIGR PVVVP