Gene Anae109_4203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4203 
Symbol 
ID5376457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4928753 
End bp4930282 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content72% 
IMG OID640845730 
ProductUbiD family decarboxylase 
Protein accessionYP_001381365 
Protein GI153007040 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTACC GCTCCCTGCG GGAGTTCCTC GATCGGCTGG AGAAGGCGGG CGAGCTCCTC 
CGCGTGAAGG AGCCGGTCGA TCCGGTCCTC GAGATGGCCG CGCTGGCGGA TCGCGCGGCG
AAGCAGGGGG GGCCGGCGCT CCTCTTCGAG AACGTGAAGG GTCGGGCGCC CGGCGGGTTC
CCGGTGGCGA TGAACCTGTT CGGGACGCGG CGCCGGGCGA GCTGGGCGCT CTCCTCGGAG
GACTTCGAGG AGCACGCGCG CGAGCTCCGC TCGCTCCTGC ACATGGCCCC GCCGCAGTCG
CTCTGGGACA AGCTGAAGAT GCTCCCGAAG CTCGGGAAGC TCGCGACGAT GACGCCGAAG
CACGTCTCCT CGGCGGCGAG CCAGGAGGTG GTGCTGCGCG AGCCCGACCT CGGTCAGCTG
CCGGTGCTCA CCACCTGGCC GCACGACGGC GGCCCGTTCG TCACCCTCCC GCAGGTGATC
ACGCGGGATC CGGACACCGG CATCCGCAAC GTCGGCATGT ACCGGCTCCA GGTGCTCGGT
CCCCGGCGGC TCGCCATGCA CTGGCAGCTC CACAAGACCG CGACGGCCCA CTACCGCGGC
TACCGGAAGC GCGGCGAGCC CATGCCGGTC GCGATCGCGC TCGGCGGCGA TCCCGCCCTC
ACCTACTGCG CCTCGGCGCC GCTGCCGCCC AACGTGGACG AGTACCTGTT CGCCGGGTTC
CTGCGCGGCG AGGCGGTGCG GATGACGAAG GGCGTGGCGG TGGACCTCGA CGTCCCCGCC
GACGCGGATC TCGTCATCGA GGGCTACGTG GACACCGCGG CACCCCTCGT GCGCGAGGGG
CCGTTCGGCG ATCACACCGG CTTCTACTCG CTCGCGGACG ACTACCCCGC CCTGGACGTG
GTCGCCGTCA CCCACCGCCG CGGCGCGATC TACCCCGCCA CCGTGGTGGG TCCGCCGCCG
GTCGAGGACC AGTGGCTCGG CAAGGCGACG GAGCGGATCT TCCTGCCGAT GCTGCAGATG
ATCTTCCCGG AGATCGTCGA CATGGCGATG CCGGTGGAGG GCGTGTTCCA CAACCTCTGC
CTCATCTCCA TCAAGAAGGA GCACCCCGGC CAGGCGAAGA AGGTCATCCA CGGTCTGTGG
GGCTCGGGCC AGATGGCGCA GACGAAGACG CTCGTGGTCT TCGACGACGA CGTGGACGTG
CAGGACGTGC CGCAGGCCGC CTGGCGCGCC TTCGCCAACG TGGACGTGAA GCGCGACCTC
GTGATCGCCG ACGGCCCGGT GGACGTGCTC GACCACGCCG CCACGCACTT CGCGTTCGGC
GGGAAGATCG GCGTGGACGC GACGCGGAAG TGGCGGGAGG AGGGCGGGCG CGAGTGGCCG
GAGGTGTGCG TCCACCCGCC CGAGGTGATC GCGCGGATGG ACGCGCTGTA CGAGCGGCTG
GTCCCCGGCG CGGAGCGGCC GCGCCGGCCG CGCATCGCGC CGCCGCCGGC CGCGTCGTGG
AAGCCGCCCA GCGGAGGGAT CCTCCAGTGA
 
Protein sequence
MAYRSLREFL DRLEKAGELL RVKEPVDPVL EMAALADRAA KQGGPALLFE NVKGRAPGGF 
PVAMNLFGTR RRASWALSSE DFEEHARELR SLLHMAPPQS LWDKLKMLPK LGKLATMTPK
HVSSAASQEV VLREPDLGQL PVLTTWPHDG GPFVTLPQVI TRDPDTGIRN VGMYRLQVLG
PRRLAMHWQL HKTATAHYRG YRKRGEPMPV AIALGGDPAL TYCASAPLPP NVDEYLFAGF
LRGEAVRMTK GVAVDLDVPA DADLVIEGYV DTAAPLVREG PFGDHTGFYS LADDYPALDV
VAVTHRRGAI YPATVVGPPP VEDQWLGKAT ERIFLPMLQM IFPEIVDMAM PVEGVFHNLC
LISIKKEHPG QAKKVIHGLW GSGQMAQTKT LVVFDDDVDV QDVPQAAWRA FANVDVKRDL
VIADGPVDVL DHAATHFAFG GKIGVDATRK WREEGGREWP EVCVHPPEVI ARMDALYERL
VPGAERPRRP RIAPPPAASW KPPSGGILQ