Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3907 |
Symbol | |
ID | 5834790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4342720 |
End bp | 4344138 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641369698 |
Product | para-aminobenzoate synthase, subunit I |
Protein accession | YP_001641349 |
Protein GI | 163853306 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.177197 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.165975 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCG CACGAGGCAT GACGGCCGAG CCCATGGTCT GGACCCGCGA GATTCCCTTC ATCGACCCGG TCGCGGCCGC GGCGCGGCTC GCCCGGTTGC CCGGCCTCGC CTTCCTCGAC AGCGCGATGC GCCACGATAC GCTCGGCCGC GTCTCGGTGC TGGCCGCCGA CCCGTTCGCG CGGTTCCGCT ACAGCGATGG TCGTGCCACC CTGGACGGGC GCGCGGTGCC CGGCTCGCCC ATCGCGGCGT TGCGGGCCTG CCTCGCGCCC TACCGTCTGG CGCCCCGGCC CGACCTGCCC GCCATTCCGG GAGCGATCGG CTATTTCGCC TACGATCTCG GTGCGAGCCT GGAGCGGGTC GCGGCCCCGG CGCGCCGGGC GGGGCTCACC GATGACATCG CCTTCAACCT CTACGACACC CTGCTCGCCG TCGATCACGG CCGCGGCACC TGCCTGCTGA TCGCCACCGG CTTTCCCGAA GCCGACGGAC CGGCCCGCGC GGCACGGGCG CAGGCGCGGC TCGATGCCTT CGCCGATTGG CTCGCCGCCC CGTCCGAACC GCTGCCGAAA TGGACGGGCG CCCGGCTCAC ATGGCGCTCA AATTTTTCGC GACAAACCTA TGAAGCGGCT GTCGAAAAGG TCCGGAACTA CATCCGCGCC GGCGACATCT ATCAGGCCAA CATCGCCCAG CGTTTCGCCG CCGACCTGCC GCCCGGTTTC GACCCGTTCG CCTTCTACCG GCGGCTTCGC GAGACCAACC CGGCGACCTT CGGCGCCTAT CTCGATTTCG ACGGGCTCAC CGTCGCCTCC TCCTCGCCCG AGCGCTTCCT CAAGTTGGAG GGGCGGGCGG TCGAGACGCG ACCGATCAAG GGCACCGTGG CCCGCGATCC CGATCCCGCC CGCGATGCCG AGATCGCCGC CGCGCTCCAG GCCAATCCGA AGGAGCGGGC CGAGAACATC ATGATCGTGG ACCTGCTGCG CAACGACCTG TCGCGGGTGT GCGAGCCGGG CAGCGTGCGG GTGCCGACCC TGTGCGGGCT GGAATCCTAT GCCGGCATCC ACCATCTCGT CTCGGTGGTG ACGGGTACGC TCCGCGAGGG TTCGGATGCG CTCGATCTCA TCCAAAAAAC CTTTCCCGGG GGCTCGATCA CCGGCGCGCC GAAGCTCAGG GCCATGGATA TCATCACCGA GATCGAGACG GATGCGCGCG AGCTCTATTG CGGGGCGATC GGCGCGCTCG GCTTCGACGG ATCGCTCGAC ACCTCGATCG CGATCCGCAC CGTGTTCATG GCGAAGGGAC AGGCCGTGCT CCAGGCGGGC GGGGGCGTGA CGCTGCTCTC CGAGCCCGGC CCCGAATACG AGGAGACGCT GACCAAGGCG GCCCGCGTCT TCGCGGCCTT CGAGGAGGAG GCGCCATGA
|
Protein sequence | MSAARGMTAE PMVWTREIPF IDPVAAAARL ARLPGLAFLD SAMRHDTLGR VSVLAADPFA RFRYSDGRAT LDGRAVPGSP IAALRACLAP YRLAPRPDLP AIPGAIGYFA YDLGASLERV AAPARRAGLT DDIAFNLYDT LLAVDHGRGT CLLIATGFPE ADGPARAARA QARLDAFADW LAAPSEPLPK WTGARLTWRS NFSRQTYEAA VEKVRNYIRA GDIYQANIAQ RFAADLPPGF DPFAFYRRLR ETNPATFGAY LDFDGLTVAS SSPERFLKLE GRAVETRPIK GTVARDPDPA RDAEIAAALQ ANPKERAENI MIVDLLRNDL SRVCEPGSVR VPTLCGLESY AGIHHLVSVV TGTLREGSDA LDLIQKTFPG GSITGAPKLR AMDIITEIET DARELYCGAI GALGFDGSLD TSIAIRTVFM AKGQAVLQAG GGVTLLSEPG PEYEETLTKA ARVFAAFEEE AP
|
| |