Gene Achl_0672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_0672 
Symbol 
ID7292102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp713813 
End bp714844 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content64% 
IMG OID643589069 
Product2OG-Fe(II) oxygenase 
Protein accessionYP_002486758 
Protein GI220911449 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACACG ACCAGGGAGC CATACCTGTT CTGGATTTGA GTACCGCACG GCAGCCCTAC 
GGGACCTTCA GTCCGGAATT CATCGAGCAG TTGCGGCACG CCACCCATGA CGTGGGCTTC
TTCCAGATCA CGGGCTACGG GGGTTCGCCG GGGCAGGCGG ACCAACTCCT TGACGCTGTC
CGGCGGTTCT TCAACCTTCC CCTTGAAGAA CGGATGAAAC TGGACAACCG GCTTTCTCCA
CACTTCCGCG GCTACACCCG GATGGGAACC GAAGTGACGC AGGGGCGGGC GGATGCGCGG
GAGCAGATCG ACTACTCTCC CGAGCGCCCG CCGGTAAGCA GCTACCCGCC GGACCAGCCG
TACTGGCTGC TGCAGGGACC AAACCAGTGG CCGGACGAAG CGTTCCCTGA ACTGAAGCCG
GCAGCCATGG CCTGGGCCGA GCTGATGTCC GCGGTGGGGA TGGAACTGCT GCGCGCCATT
GCGGTGACGC TGCAACAACC CGAGGACTAT TTCGACGAAC CGTTCCGGGA AGCACCGGCA
TGGATGGGCA AATTGGTCCA TTATGTTGGC GGCGTGGTCA AAGAGGCAGG TAACCAGGGG
GTGGGTTCCC ATGCTGACTA CGGGTTCGTG ACACTCCTGC TGCAGGACGA CGTTGGAGGC
CTGGAAGTAA AGCCGCCGGG GACCTCGGAA TGGCTTCCGG TGGAGCCCCT GCCCGGCGCG
TTGGTGGTGA ACCTCGGCGA AATGCTGGAA GTGGCCACCG AGGGATACCT TGCGGCCACG
ATCCACCGCG TGCAGGCACC GCCTCCGGGT GTGGACCGCT ATTCGGTGCC GTTCTTCTGG
TCGCCCCGCT TGGACTCAGT CATCCAGCCT GTTCCGCTGG CCCCGGAGTT GAAGGCCGCC
GCACGCGGCA TTACGGACGA TCCCGGCAAC CCGTTGCTCG CATCCTTTGG CCTCAACATG
CTCAAGGGCA GAATGCGGGC GCACCCGGAC GTCACCGAGC GGCATTACCC GGACCTGCTG
AAGCGGAGCT AG
 
Protein sequence
MSHDQGAIPV LDLSTARQPY GTFSPEFIEQ LRHATHDVGF FQITGYGGSP GQADQLLDAV 
RRFFNLPLEE RMKLDNRLSP HFRGYTRMGT EVTQGRADAR EQIDYSPERP PVSSYPPDQP
YWLLQGPNQW PDEAFPELKP AAMAWAELMS AVGMELLRAI AVTLQQPEDY FDEPFREAPA
WMGKLVHYVG GVVKEAGNQG VGSHADYGFV TLLLQDDVGG LEVKPPGTSE WLPVEPLPGA
LVVNLGEMLE VATEGYLAAT IHRVQAPPPG VDRYSVPFFW SPRLDSVIQP VPLAPELKAA
ARGITDDPGN PLLASFGLNM LKGRMRAHPD VTERHYPDLL KRS