Gene Arth_3425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3425 
Symbol 
ID4444155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3855308 
End bp3856672 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content70% 
IMG OID639691249 
Producthydroxydechloroatrazine ethylaminohydrolase 
Protein accessionYP_832900 
Protein GI116671967 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.507053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACCC ACCGCTCCGC ACACAGGCTC TGGATCCGGA ATCCACTCGC AGCCTTCACC 
GCCAACAATC TTGATGCCAC CGGCGGGATC GTGGTGGCCG GCGGCATCAT CACGGAAGTC
CTGGCCGCCG GCCAGCAGCC TTCCGCGCCC TGCCAGGAAA CGTTCGAGGC CGGCAGCCAC
GTCCTGCTGC CGGGCCTGAT CAACACCCAC CACCACTTCT ACCAAACACT CACGCGTGCC
TGGGGTCCGG TGGCCAACGT CCCGCTGTTT CCGTGGCTGC AGAACCTGTA CCCGGTCTGG
GCCCGGCTCA AGCCGCGGGA CCTGGAACTG GCTACCACCG TTGCACTCGC GGAACTGCTG
CTCTCCGGCT GCACCACAGC CGCTGACCAC CACTACCTCT TCCCCCAGGG CATGGAAGAC
GCCATCGACA TCGAGGTCCG GGCGGTGCGG GAGCTCGGCA TGCGGGCCAC GCTCACCCGC
GGCTCCATGA CGCTCGGAGA GGACGACGGC GGCCTGCCGC CACAGTCCAC CGTCCAGCAG
CCGGACGTGA TCCTGGCGGA CAGCGAGCGG CTCATCCGGG AGTATCACGA ACGCGGCGAC
GGCGCCGTCA TCCAGGTTGC CCTGGCCCCG TGCTCGCCGT TCTCCGTGAC CAAGGAGATC
ATGGCCGAGA GCGCCGCACT GGCCGAACGG CATGACGTCC GGCTGCACAC GCACCTGGCT
GAAACGCTGG ACGAGGAAGA CTTCTGCCGG AAGATGTTCG GCCTGCGCAC GGTGGAATAC
CTGGAGAGCG TGGGCTGGCT CGGCAACCGG ACCTGGCTGG GCCACGGCAT CCATTTCAGC
GATGCAGAGA TCGCCGCGCT GGGAGCCGCG GGCACCGCCG TCGCGCACTG CCCCACGTCC
AACATGCGGC TGGCCTCGGG CACTGCCCGG GTACTCGAAC TGGAGGATGC CGGAGTGCCG
GTGGGGCTGG GAGTGGACGG GTCGGCGTCG AACGACGCCT CGAACATGAT CCTGGAGGCA
CGGCAGGCCC TGTACCTGCA GCGGCTGCGC TACGGGGCGC AGGTCCCGGT GGAGCGGGCG
CTGGGCTGGG CGACCCGGGG GTCGGCGGCG GTGCTGGGCC GCTCCGACCT GGGCCAGCTG
GCACCCGGGA TGCAGGCGGA CCTGGCGTTG TTCCGGCTCG ACGAGCTGCG GTTCTCCGGC
AGCCACGACC CCCTCGCCGC GCTCCTGCTG TGCGGAGCGG ACCGGGCCGA CCGGGTGATG
GTGGGCGGGC AGTGGCGCGT GGTGGACGGG CAGATCCCGG GCCTTGATGT TGCCGGGCTG
ATCGCGGAAC ACTCGGCCGC TGCACGGAAG CTGGTGAACG GGTAG
 
Protein sequence
MATHRSAHRL WIRNPLAAFT ANNLDATGGI VVAGGIITEV LAAGQQPSAP CQETFEAGSH 
VLLPGLINTH HHFYQTLTRA WGPVANVPLF PWLQNLYPVW ARLKPRDLEL ATTVALAELL
LSGCTTAADH HYLFPQGMED AIDIEVRAVR ELGMRATLTR GSMTLGEDDG GLPPQSTVQQ
PDVILADSER LIREYHERGD GAVIQVALAP CSPFSVTKEI MAESAALAER HDVRLHTHLA
ETLDEEDFCR KMFGLRTVEY LESVGWLGNR TWLGHGIHFS DAEIAALGAA GTAVAHCPTS
NMRLASGTAR VLELEDAGVP VGLGVDGSAS NDASNMILEA RQALYLQRLR YGAQVPVERA
LGWATRGSAA VLGRSDLGQL APGMQADLAL FRLDELRFSG SHDPLAALLL CGADRADRVM
VGGQWRVVDG QIPGLDVAGL IAEHSAAARK LVNG