Gene Achl_3544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3544 
Symbol 
ID7295025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp3930183 
End bp3931202 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content65% 
IMG OID643591950 
ProductCatechol 2,3-dioxygenase 
Protein accessionYP_002489589 
Protein GI220914280 
COG category[R] General function prediction only 
COG ID[COG2514] Predicted ring-cleavage extradiol dioxygenase 
TIGRFAM ID[TIGR03211] catechol 2,3 dioxygenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAACTC CCCTCTCCCA TCTTGCCCAC CTTGAGATCA CCACCCCCGA CGTCGAAGCC 
TCGGCCAAGT TCTACGAGGA AAAGTTCGGA ATGCGCATCA TTGACCGGGT GGATGGCAAC
GCCTACCTGC GCTGCTGGGG CGACTACTAC CGTTACAGCC TGGTCATCAC CGAAGGCCCT
GAAGCGTCCC TCGGCCGGAT GGCCTGGCGC ACCAATTCGC AGGCAGCCCT GGAAGCCGCC
GCCCAGCGCA TTGAAACCAC TGGTGTACAG GGCACCTGGA CCGCCGGCGG CCACGGCTAC
GGCAAGGCCT ACGAGTTCAC CGGCCCCTAT GGCCACCACA TGCGCCTGTT CTATGAGGTG
GAAAAGTTCG TGGCCGAGCC CGGCTTCGAG TCCACCTATC CGGACCGTCC CGAGCGTCGC
AGCAGCCACG CGGCCGCCCC GCGATTCCTG GACCACGTCA CCGTCGCCAC GCAGGACGTC
CGCGGCTTTG CCAAGTGGTA CAACGAGGCC CTCGGCTTCC GCGTCATGGC ATTCGTGGAC
CTGGACGAAG CCCCCATCAC GGTCTTCTCG GTCCTGACCA CCAACGAAAA GTCCCACGAC
CTCGGCGTCG TCCTGGACAC CTCCAACCGC CCCGGCCGCG TCAACCACAT TGCCTTCTGG
GTAGATGCCA CGGAGGACCT GCTCCGCACC GCCGACGTCA TGATGGAGAA CGGGACCCCC
ATGGAATATG GCCCCTCCAT CCACGGCGTG GGCGAGCAGA ACTTCCTCTA CTTCCGTGAC
CCCTCCGGCC TGCGCGTCGA GCTGAACTCC GGCGGCTACC GCAACTACGT TCCGGACTGG
GAGGCCAACA CCTGGAAGCC GTCCGAGGGC TCCAATAACT TCTACAAGAA CGGCGCCATG
CCGCACTCCA TGACCGAGTC CTTCCCGCCG GCCGAAGGTT TCACCGCCAC TGAAGAGGGC
GCCTCCCCGG AAATGAAGGA AGCACTCCTG AACCCCTACG CCCAGCAGGG CCGGGGCTAA
 
Protein sequence
METPLSHLAH LEITTPDVEA SAKFYEEKFG MRIIDRVDGN AYLRCWGDYY RYSLVITEGP 
EASLGRMAWR TNSQAALEAA AQRIETTGVQ GTWTAGGHGY GKAYEFTGPY GHHMRLFYEV
EKFVAEPGFE STYPDRPERR SSHAAAPRFL DHVTVATQDV RGFAKWYNEA LGFRVMAFVD
LDEAPITVFS VLTTNEKSHD LGVVLDTSNR PGRVNHIAFW VDATEDLLRT ADVMMENGTP
MEYGPSIHGV GEQNFLYFRD PSGLRVELNS GGYRNYVPDW EANTWKPSEG SNNFYKNGAM
PHSMTESFPP AEGFTATEEG ASPEMKEALL NPYAQQGRG