Gene Anae109_3485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3485 
Symbol 
ID5374549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4091846 
End bp4093003 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content74% 
IMG OID640845009 
Producthomogentisate 12-dioxygenase 
Protein accessionYP_001380652 
Protein GI153006327 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.306053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACC GGATCGTGCA GGGCGCGGTC CCGCGGAAGC ACCACATCGC CTTCCGCGAT 
CCCGACGGGC GCCTCCTGCA CGAGGAGGCC TTCACGCGCG CCGGGTTCGA CGGTGCGTAC
ACCCTCGCCT ACCACCGGCA CCGGCCCCAC GCGACGCACG CCGCCGAGGT CCGTCACGGC
TGGACGCTGC CGCGGGCGGC GCCGCCGCGC GGTCTCCTGA AGCGCCACTA CCGCACGCAG
GAGCTCCCCC TGCCCGCGGG GCCGGCGGTG GACGCGCGCG TCCCGCTCCT CTTCAACGCG
GACGTGGTCG TCGGCCTCGC CACGCCGAGC GCGGAGGATC CGGTCTACCT CTCGAACGGC
GACGGCGACG ACCTCTTCTT CGTCCTCGAG GGAGGCGGGC TCTTGCGCAC GCCGCTCGGC
GACCTGCGCT TCGCGCAGGA CGACTACGTC TACGTGCCGA AGGGGCTCCT CCACCGCTTC
GTGCCGGGCG CGGGTCCGCA GCGCTGGCTC TCCCTCGAGT TCCCGGGCGG CCTCCACCTG
CCCTCCCAGT GGCGCAACGA GACCGGCCAG CTCCGCATGG ACGCGCCCTA CTCCCACCGC
GACTTCCGCC GCGTCGAGTG GACGGGCCCG CTCGACGAGG GGATCCGCGA GCTCCTCGTC
AAGCGCGCCG GGGCTTTCCA CGCCTTCCGC TACGACGAGT CGCCGCTCGA CGTGGTGGGC
TGGGACGGCG CCGTGTACCC GTTCGCCTTT CCCATCCTGA ACTTCCAGCC GCGCGCCGGG
CTCGTGCACC TGCCGCCCAC CTGGCACGGC ACCTTCGCCG CCCGCGGCGC GCTGGTGTGC
TCGTTCGTCC CGCGCGTGGT GGACTTCCAC CCCGAGGCGA TCCCCTGCCC CTACCCGCAC
GCCTCGCCGG ACGTGGACGA GATCCTGTTC TACGTCCGCG GGGAGTTCAC CTCCCGGCGC
GGCGTGGGCC CCGGCTCGAT CTCGCACCAC CCCGCGGGCG TGATGCACGG GCCGCACCCG
GGCGCCTACG AGGGCTCGAT CGGCGCCCGC ACCACGAGCG AGCTCGCGGT CATGCTCGAC
TGCTACCTCC CGCTCGCCGC GACCCCCGCC GCGCTGGGGA TCGAGGACCC CGGCTACCAG
GAGAGCTTCG TGCGCTGA
 
Protein sequence
MLDRIVQGAV PRKHHIAFRD PDGRLLHEEA FTRAGFDGAY TLAYHRHRPH ATHAAEVRHG 
WTLPRAAPPR GLLKRHYRTQ ELPLPAGPAV DARVPLLFNA DVVVGLATPS AEDPVYLSNG
DGDDLFFVLE GGGLLRTPLG DLRFAQDDYV YVPKGLLHRF VPGAGPQRWL SLEFPGGLHL
PSQWRNETGQ LRMDAPYSHR DFRRVEWTGP LDEGIRELLV KRAGAFHAFR YDESPLDVVG
WDGAVYPFAF PILNFQPRAG LVHLPPTWHG TFAARGALVC SFVPRVVDFH PEAIPCPYPH
ASPDVDEILF YVRGEFTSRR GVGPGSISHH PAGVMHGPHP GAYEGSIGAR TTSELAVMLD
CYLPLAATPA ALGIEDPGYQ ESFVR