Gene M446_0053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_0053 
Symbol 
ID6130415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp60537 
End bp61919 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content76% 
IMG OID641640396 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_001767075 
Protein GI170738420 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.071611 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGACCC GGCCCCTCCC GCTCCTCGAC CCGATCGCCG CCGCGTCGCG CCTGCGGGGG 
CGCAGGGGGC TCGCCCTCCT CGACAGCGCG ATGCGCCACC CGGACCTCGG GCGCTACTCC
TACGTCGCGG CCGACCCGTT CCGCGAGATC CGGGTGCGGG ACGGGCGGAC CCTCCTCGAC
GGCGTACCCG CCGAGGGGCC GCCGCTCGCC GCAATCCGGC GGGCGCTCGC CCCCTACCGG
GCCGAGCGGA GGCCCGACCT TCCCCCGTTC CAGGGCGGGG CGATCGGCTA CCTCGCCTAC
GATTTCGGCC GGGCGCTGGA GCGCGTGGCG GCCCCGCCGC GCCGCGCCGG GCTCTGCGAC
GACCTCGCCC TCAACCTCTA CGCCACGATC CTCGCCTTCG ACCACGGCGA GGGGACCTGC
GCCCTCGTCG CCACCGGCTT CCCCGAGACC GAGCCCGACG CCCGCGCGCG CCGCGCCCGC
GCCGATCTCG ACGCCTGGGA GGCGGCCCTC GCGGCGCCCG AACCGGCCCC GGCGCGCCGC
GCCGTCCCGC CGCTCGCCTG GGAGTCGAAC TTCACGCGCG ACGGCTACGC GGCGGCGGTG
GAGCGGGTGC GGGACTACAT CCGGGCCGGC GACATCTACC AGGCGAACAT CGCGCAGCGC
TTCGCGGCCG CGCTGCCGCC GGACTTCGAC GCCTTCGCGC TCTACCGGCG GCTGCGGGCG
CGCAACGCCG CGACCTTCGC GGCCTATCTC GAACTCGGCG CGCTCACGGT GGCGTCGAGC
TCGCCCGAGC GCTTCCTGCG CCTCGACGGC CGCCGGATCG AGACGCGGCC GATCAAGGGC
ACGGCCCCGC GCGCGGCCGA CCCGGCCGAG GACCGCGCCC GGGCCGAGGC GCTGCTCGCC
AGCGACAAGG AGCGGGCCGA GAACGTGATG ATCGTCGACC TCCTGCGCAA CGACCTGTCG
CGGATCAGCG AGCCGCACAG CGTCGCCGTG CCGGTGCTGT GCGGGCTCGA AACCTACGCG
GGGGTGCACC ACCTCGTCTC GGTGGTGACC GGGCGGCTGC GCGCGGGCGC CGACGCGCTC
GACCTCCTGG CCGCCACCTT CCCGGGCGGC TCGATCACCG GCGCGCCCAA GCTGCGCGCC
ATGGACATCA TCACGGAGAT CGAGGGCGAT GCCCGCGAAT TGTTCTGCGG CAGCATCGGC
TGGATCGGGT TCGACGGCAG CCTCGACACC AACATCGCCA TCCGCACGGT GTTCATGGAG
GCGGGCCGCG CGGTGCTGCA GGCGGGCGGC GGGGTCACGC TGCTCTCCGA TCCGCTCGCC
GAGTACGAGG AGACGCTGAC CAAGGCGGAG CGGGTCTTCG CGGCCTTCCC GGAGGCGGGC
TGA
 
Protein sequence
MWTRPLPLLD PIAAASRLRG RRGLALLDSA MRHPDLGRYS YVAADPFREI RVRDGRTLLD 
GVPAEGPPLA AIRRALAPYR AERRPDLPPF QGGAIGYLAY DFGRALERVA APPRRAGLCD
DLALNLYATI LAFDHGEGTC ALVATGFPET EPDARARRAR ADLDAWEAAL AAPEPAPARR
AVPPLAWESN FTRDGYAAAV ERVRDYIRAG DIYQANIAQR FAAALPPDFD AFALYRRLRA
RNAATFAAYL ELGALTVASS SPERFLRLDG RRIETRPIKG TAPRAADPAE DRARAEALLA
SDKERAENVM IVDLLRNDLS RISEPHSVAV PVLCGLETYA GVHHLVSVVT GRLRAGADAL
DLLAATFPGG SITGAPKLRA MDIITEIEGD ARELFCGSIG WIGFDGSLDT NIAIRTVFME
AGRAVLQAGG GVTLLSDPLA EYEETLTKAE RVFAAFPEAG