Gene Mext_3907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3907 
Symbol 
ID5834790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4342720 
End bp4344138 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content70% 
IMG OID641369698 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_001641349 
Protein GI163853306 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.177197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.165975 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG CACGAGGCAT GACGGCCGAG CCCATGGTCT GGACCCGCGA GATTCCCTTC 
ATCGACCCGG TCGCGGCCGC GGCGCGGCTC GCCCGGTTGC CCGGCCTCGC CTTCCTCGAC
AGCGCGATGC GCCACGATAC GCTCGGCCGC GTCTCGGTGC TGGCCGCCGA CCCGTTCGCG
CGGTTCCGCT ACAGCGATGG TCGTGCCACC CTGGACGGGC GCGCGGTGCC CGGCTCGCCC
ATCGCGGCGT TGCGGGCCTG CCTCGCGCCC TACCGTCTGG CGCCCCGGCC CGACCTGCCC
GCCATTCCGG GAGCGATCGG CTATTTCGCC TACGATCTCG GTGCGAGCCT GGAGCGGGTC
GCGGCCCCGG CGCGCCGGGC GGGGCTCACC GATGACATCG CCTTCAACCT CTACGACACC
CTGCTCGCCG TCGATCACGG CCGCGGCACC TGCCTGCTGA TCGCCACCGG CTTTCCCGAA
GCCGACGGAC CGGCCCGCGC GGCACGGGCG CAGGCGCGGC TCGATGCCTT CGCCGATTGG
CTCGCCGCCC CGTCCGAACC GCTGCCGAAA TGGACGGGCG CCCGGCTCAC ATGGCGCTCA
AATTTTTCGC GACAAACCTA TGAAGCGGCT GTCGAAAAGG TCCGGAACTA CATCCGCGCC
GGCGACATCT ATCAGGCCAA CATCGCCCAG CGTTTCGCCG CCGACCTGCC GCCCGGTTTC
GACCCGTTCG CCTTCTACCG GCGGCTTCGC GAGACCAACC CGGCGACCTT CGGCGCCTAT
CTCGATTTCG ACGGGCTCAC CGTCGCCTCC TCCTCGCCCG AGCGCTTCCT CAAGTTGGAG
GGGCGGGCGG TCGAGACGCG ACCGATCAAG GGCACCGTGG CCCGCGATCC CGATCCCGCC
CGCGATGCCG AGATCGCCGC CGCGCTCCAG GCCAATCCGA AGGAGCGGGC CGAGAACATC
ATGATCGTGG ACCTGCTGCG CAACGACCTG TCGCGGGTGT GCGAGCCGGG CAGCGTGCGG
GTGCCGACCC TGTGCGGGCT GGAATCCTAT GCCGGCATCC ACCATCTCGT CTCGGTGGTG
ACGGGTACGC TCCGCGAGGG TTCGGATGCG CTCGATCTCA TCCAAAAAAC CTTTCCCGGG
GGCTCGATCA CCGGCGCGCC GAAGCTCAGG GCCATGGATA TCATCACCGA GATCGAGACG
GATGCGCGCG AGCTCTATTG CGGGGCGATC GGCGCGCTCG GCTTCGACGG ATCGCTCGAC
ACCTCGATCG CGATCCGCAC CGTGTTCATG GCGAAGGGAC AGGCCGTGCT CCAGGCGGGC
GGGGGCGTGA CGCTGCTCTC CGAGCCCGGC CCCGAATACG AGGAGACGCT GACCAAGGCG
GCCCGCGTCT TCGCGGCCTT CGAGGAGGAG GCGCCATGA
 
Protein sequence
MSAARGMTAE PMVWTREIPF IDPVAAAARL ARLPGLAFLD SAMRHDTLGR VSVLAADPFA 
RFRYSDGRAT LDGRAVPGSP IAALRACLAP YRLAPRPDLP AIPGAIGYFA YDLGASLERV
AAPARRAGLT DDIAFNLYDT LLAVDHGRGT CLLIATGFPE ADGPARAARA QARLDAFADW
LAAPSEPLPK WTGARLTWRS NFSRQTYEAA VEKVRNYIRA GDIYQANIAQ RFAADLPPGF
DPFAFYRRLR ETNPATFGAY LDFDGLTVAS SSPERFLKLE GRAVETRPIK GTVARDPDPA
RDAEIAAALQ ANPKERAENI MIVDLLRNDL SRVCEPGSVR VPTLCGLESY AGIHHLVSVV
TGTLREGSDA LDLIQKTFPG GSITGAPKLR AMDIITEIET DARELYCGAI GALGFDGSLD
TSIAIRTVFM AKGQAVLQAG GGVTLLSEPG PEYEETLTKA ARVFAAFEEE AP