Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU0523 |
Symbol | pabB |
ID | 2685977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 557100 |
End bp | 558890 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637125189 |
Product | para-aminobenzoate synthase, component I |
Protein accession | NP_951581 |
Protein GI | 39995630 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.120338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGCG CGCCCACGGT CATTCTCGCC TCTTTCGATG CCGAGCGGCA TTCGGCCTCG TACCGGTTCG AGGAGTTTGT GGAAGCCGTG ACGGCCCTGA CCCCTGCCGA GGTCGTGCCG GCCCTGCGCC GGGTGGAAGC GGCGGTGGCC GGCGGTCTCC ACGCGGCGGG ATTCGTCAGC TATGAGGCGG CGCCCGGCCT GGACGAAACC CTGACAACCC GCGAACCGGT GCCGGACACC CCGCTGGTCT GGTTCGGCCT GTTCCGCCGC CGCATCGGCT TTGCGCCCCG GCTCCCCGAA TGCGAGCAGG ACGTGCCACC CGGCTACGAG ACCAGCCAGT GGAGCGCCAC GCTCCAGCGG GAGCCCTACC TGGAGTCAGT CGGTCGGATC AGGCAGTACA TAACGGCCGG CGACTGCTAT CAGGTCAACT TCACCTTCCG CCAGCAGTTC CGCTTCACGG GCGATCCCCA GGCATGGTTC CACGATCTCT GCCGGGCCCA GAGAGCCCCT TTCTGTGCCT TCATCGATAC GGGATCGCTC CGGGTCCTCT CCACCTCGCC CGAACTGTTC TTCGACCTGC GCCAGGGGAC CCTCACCTGC CGCCCCATGA AGGGAACCGC CCGCAGGGGA CGCTGGCGGG CCGAGGACGA GGAGTTACGC GCGGGACTTG CCGCCAGCGA GAAGGAGCGG GCCGAAAACC TGATGATCGT CGACCTGCTG CGCAACGACA TGGGAATGGT GGCTGAAACG GGCTCGGTGC GGGTGGAGTC GCTCTTTGAC GTGGAAAGCC TCGAAACGCT CCACCAGATG ACCTCCACCA TCACGGCCCG GCCGCAGGCC GGGGTCGGCC TCGCCGATCT CTTCCGGGCG CTCTTCCCCT GCGGGTCGGT GACCGGTGCG CCCAAGCGGC GGAGCATGGA GATAATCCGG GAGCTGGAGG ATTCGCCCCG GGGGATCTAC ACCGGCGCCA TCGGCTACGT CTCCCCGGCG GCGCAGGGGG CACCCGCCCC CTTTGAGGCG ACCTTCAGTG TCGCCATCAG GACAGTGGTC CTGGACGCCG CATCGGGGCA GGGGCAGTTG GGCATCGGCA GCGGTGTGAC CATCGGCTCG ACCCCTTCGT CGGAGTATGA CGAGTGCCTC GCCAAGAGCA GATTCGCCCG GGAGCGTGTC CCCGACTTCC AGTTGGTGGA GACGCTGCTC CACGAGGAAG GAGCGGGATT TTTCCTGCTG GAGCGCCATC TGGCGCGACT CTACCGGTCA GCCGCCCATT TCGGGATTCC GCTCCGGCTC GGCAGCCTCC AGGAGATCCT CAACCGACGG GCCGCCCTGA TGGAGGGTCG GCAAAAGGTG CGCGTACTGG TGAACCGGCG GGGGGCGTTC ACCATCCAGG AAGCACCGCT GACCGAAGCG CCCTGCCCGG AACCGATTCC CGTCCGCTTT GCGGCCACGT CAGTGGACCC GGCCGATCAG TTCCTCTACC ACAAGACCAC CTACCGCCCC CTCTACCGGC ACGAACTGGC GGCGGCGCCC GACTGCGCAG ACGTCATCTT CGTAAACCGG CACGGTGAAG TGACCGAGGG AACCACGGCC AATGTGGCCG CCCGCATCGA CGGGGAAATG GTCACCCCTC CCCTTGCCGC CGGCATCCTC CCCGGCACCT TCCGGGAAGA GCTCCTGGCC GAGGGCGCCC TCCGCGAACG GCCCATCACG CGGGAGGAAC TGGAACGGTG CCCGGAGATC TACCTCATCA ACTCGGTCCG CCGGTGGCGG CCGGTGACTC TCATCACCTG A
|
Protein sequence | MSGAPTVILA SFDAERHSAS YRFEEFVEAV TALTPAEVVP ALRRVEAAVA GGLHAAGFVS YEAAPGLDET LTTREPVPDT PLVWFGLFRR RIGFAPRLPE CEQDVPPGYE TSQWSATLQR EPYLESVGRI RQYITAGDCY QVNFTFRQQF RFTGDPQAWF HDLCRAQRAP FCAFIDTGSL RVLSTSPELF FDLRQGTLTC RPMKGTARRG RWRAEDEELR AGLAASEKER AENLMIVDLL RNDMGMVAET GSVRVESLFD VESLETLHQM TSTITARPQA GVGLADLFRA LFPCGSVTGA PKRRSMEIIR ELEDSPRGIY TGAIGYVSPA AQGAPAPFEA TFSVAIRTVV LDAASGQGQL GIGSGVTIGS TPSSEYDECL AKSRFARERV PDFQLVETLL HEEGAGFFLL ERHLARLYRS AAHFGIPLRL GSLQEILNRR AALMEGRQKV RVLVNRRGAF TIQEAPLTEA PCPEPIPVRF AATSVDPADQ FLYHKTTYRP LYRHELAAAP DCADVIFVNR HGEVTEGTTA NVAARIDGEM VTPPLAAGIL PGTFREELLA EGALRERPIT REELERCPEI YLINSVRRWR PVTLIT
|
| |