Gene Achl_3520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3520 
Symbol 
ID7295001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp3897838 
End bp3899124 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content71% 
IMG OID643591926 
Productallantoate amidohydrolase 
Protein accessionYP_002489565 
Protein GI220914256 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0261645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCTTC CCCAGACAGC ACCCGCACCG CCTGCCGTAC CCGCAGAAAC CGCCGGCGCA 
CCCACCGTTG CCGGCCTCCT GAAGGAAATC TCCGACGTCG GGCGTGACAG GACCCGCGGC
GGCTACTCCC GCCCGGTGTT CTCCACCGCC GAAACGGACC TGCGGATCTG GTTCATCGAG
CGGGCCACCC GGCGTGGGCT GGACGTCCAC ACCGACGCCA ACGGCATCAT CTGGGCCTGG
TGGGACACGG CCGCGGGTGT GCGGAAGGAC GCGGTGGCCA CCGGCAGCCA CCTCGATTCC
GTCCCCGGCG GCGGCGAGTA TGACGGCCCC CTGGGGGTCG CCTCGGCACT GGTGGCCGTC
GACCTCCTCA AAGCACGCAA CTTCCGCCCG CGCCGCCCCC TGGCGATCGC AGTGTTTCCC
GAGGAGGAAG GCTCGCGGTT CGGCATCGCC TGCCTTGGCT CGCGGCTCCT CACCGGCGAA
CTCGATCCCA ACAAGGCCCG CAACCTCCGC GACCCGGACG GCAACACCTA CGCCGACGTC
GCAGCGGCCA ACGGACAGGA CCCGCGGTTC ATCGGCGCCG ACTACAAGGC GCTGCAGCAG
CTGGGCCTGT TCGTTGAACT GCACGTGGAA CAGGGGAGGG GCCTGATCGA CCTGGACCAG
CCGGTGGCGG TTGGTTCGTC CATCCTGGGC CACGGCCGCT GGAAACTGGC CATCTCCGGC
GAGGGAAACC ACGCAGGCAC CACACTGATG CAGGACCGCA GGGACCCCAT GATCGCGGCC
GCCAAAGTGG TGGTGGGCAT CCGTGAGACC GCCCGCAAGT ACCGGGACGC CCGTGCCACG
GTGGGCCGGC TGCAACCCGT CCCCGGCGGC ACCAACGTCA TCGCGTCCCG CGTGGACCTG
TGGATCGACG TCCGCCACCC GGAGGACTCC GTCACCGCCG CGCTGGTGGA GGCCATCGGG
CTGAACGCCC AGGTCCTCGC CGCCGAGGAA GGCTGTTCCG CCGCCCTCAC CAGGGAGTCG
CTGAGCCCCA CAGTGCAGTT CGACGACGGA CTCCGGGACC GGCTGCAGCA GCTCCTTCCT
GCCGCTCCCG TGCTGGCCAC CGGTGCAGGG CACGACGCCG GGGTGCTGGC GGCGCACCTG
CCCACGGCCA TGCTGTTCGT CCGCAACCCC ACGGGCATCT CGCATTCGCC CGACGAACTG
GTGGAGGACC GGGACGCCGA AGCCGGCGCC CTTGCCCTGG CGGACTCCCT GGCCGGGCTC
CTGGGCGGGG CCCGTACCGT TGGCTAG
 
Protein sequence
MSLPQTAPAP PAVPAETAGA PTVAGLLKEI SDVGRDRTRG GYSRPVFSTA ETDLRIWFIE 
RATRRGLDVH TDANGIIWAW WDTAAGVRKD AVATGSHLDS VPGGGEYDGP LGVASALVAV
DLLKARNFRP RRPLAIAVFP EEEGSRFGIA CLGSRLLTGE LDPNKARNLR DPDGNTYADV
AAANGQDPRF IGADYKALQQ LGLFVELHVE QGRGLIDLDQ PVAVGSSILG HGRWKLAISG
EGNHAGTTLM QDRRDPMIAA AKVVVGIRET ARKYRDARAT VGRLQPVPGG TNVIASRVDL
WIDVRHPEDS VTAALVEAIG LNAQVLAAEE GCSAALTRES LSPTVQFDDG LRDRLQQLLP
AAPVLATGAG HDAGVLAAHL PTAMLFVRNP TGISHSPDEL VEDRDAEAGA LALADSLAGL
LGGARTVG