Gene M446_1324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1324 
Symbol 
ID6134621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1457416 
End bp1458744 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content67% 
IMG OID641641603 
Productbenzoate 1,2-dioxygenase, large subunit 
Protein accessionYP_001768274 
Protein GI170739619 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID[TIGR03229] benzoate 1,2-dioxygenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0648485 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGAAG ATCTGCATCA GATTGTCGAG ACCGCGGTTG AGGATAATCC CGAGACGGGC 
ACCTATCGCT GCCGGCGCGA CATCTTCACG GATCCCGACA TCTTCGAGTT GGAGATGCGG
CATATATTCG AGGGGAACTG GATCTACATG GCCCACGAGA GCCAGATCCC GAACCCGAAC
GATTATTTCA CGACCTATAT GGGCCGCCAG CCGGTGGTCA TCACCCGCAA CCGCAAGGGC
GAGCTCCAGG CCTTCGTCAA CGCGTGCAGC CATCGCGGCG CGATGATCTG CCGGCACAGG
CGCGGCAACA AGGCGACCTT CACCTGCCCG TTCCACGGCT GGACCTTCAG CAACGGCGGC
AGGCTGCTCA AGGTCAAGGA CCCGGAGGGC GCGGGCTATC CCGACAGCTT CAACCGCGAC
GGCTCGCACG ACCTGACGCC CGTGGCCCGG TTCGAGAGCT ACCGCGGTTT CCTGTTCGGC
AGCCTCAACC CGGACGTGAA GCCGCTCACC GAGCACCTCG GCGAGGCGAC GAGGATCATC
GACATGATCG TCGACCAGTC GCCGGACGGG CTCGAAGTGC TGCGCGGCGC CTCGACCTAC
GTCTTCGACG GCAACTGGAA GCTCCAGACC GAGAACGGCG CCGACGGCTA CCACGTCAGC
GCGACGCACT GGAACTACGC CGCCACGACG AGCCGCCGCA AGGAGACCCA CGTCGTCGAC
AAGACCCGCG CGATGGATGC GGGCGGCTGG GCGAAGCAGG GCGGCGGCTT CTACTCCTTC
GCGCACGGGC ACCTGCTGCT CTGGACCACC TGGGCCAACC CGGAGGACCG CCCGAACTGG
GACCGGCGCG AGGAGCTGGC GGCGGCTCAC GGGCAGGCGA TGGCGGACTG GATGATCAGC
CGCTCGCGCA ACCTCTGCCT CTACCCGAAC GTCTACCTGA TGGACCAGTT CTCGTCGCAG
ATCCGCACCT ACCGGCCGAT CGCGGTCGAT AAGACCGAGG TGACGATCTA CTGCATCGCC
CCGAAGGGCG AGGCCCCCGA GGCGCGGGCC CGGCGCATCC GCCAGTACGA GGATTTCTTC
AACGCGAGCG GCATGGCGAC CCCGGACGAC CTCGAGGAGT TCCGGGCCTG CCAGCTCGGC
TACCAGGGCC GGGCGGCGCG CTGGAACGAC CTCTGCCGGG GCGCCACCCA CTGGATCGAG
GGCGCCGACG AGGGCGCGCG GGCGATCGGC CTCGAGCCGC TGCTCAGCGG CGTCAGGACC
GAGGACGAGG GCCTGTTCGC GATCCAGCAC CGCTACTGGA TGGAGACGAT GCGCCGGCAC
CTGCCGTAG
 
Protein sequence
MLEDLHQIVE TAVEDNPETG TYRCRRDIFT DPDIFELEMR HIFEGNWIYM AHESQIPNPN 
DYFTTYMGRQ PVVITRNRKG ELQAFVNACS HRGAMICRHR RGNKATFTCP FHGWTFSNGG
RLLKVKDPEG AGYPDSFNRD GSHDLTPVAR FESYRGFLFG SLNPDVKPLT EHLGEATRII
DMIVDQSPDG LEVLRGASTY VFDGNWKLQT ENGADGYHVS ATHWNYAATT SRRKETHVVD
KTRAMDAGGW AKQGGGFYSF AHGHLLLWTT WANPEDRPNW DRREELAAAH GQAMADWMIS
RSRNLCLYPN VYLMDQFSSQ IRTYRPIAVD KTEVTIYCIA PKGEAPEARA RRIRQYEDFF
NASGMATPDD LEEFRACQLG YQGRAARWND LCRGATHWIE GADEGARAIG LEPLLSGVRT
EDEGLFAIQH RYWMETMRRH LP