Gene GSU0523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0523 
SymbolpabB 
ID2685977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp557100 
End bp558890 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content68% 
IMG OID637125189 
Productpara-aminobenzoate synthase, component I 
Protein accessionNP_951581 
Protein GI39995630 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.120338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGCG CGCCCACGGT CATTCTCGCC TCTTTCGATG CCGAGCGGCA TTCGGCCTCG 
TACCGGTTCG AGGAGTTTGT GGAAGCCGTG ACGGCCCTGA CCCCTGCCGA GGTCGTGCCG
GCCCTGCGCC GGGTGGAAGC GGCGGTGGCC GGCGGTCTCC ACGCGGCGGG ATTCGTCAGC
TATGAGGCGG CGCCCGGCCT GGACGAAACC CTGACAACCC GCGAACCGGT GCCGGACACC
CCGCTGGTCT GGTTCGGCCT GTTCCGCCGC CGCATCGGCT TTGCGCCCCG GCTCCCCGAA
TGCGAGCAGG ACGTGCCACC CGGCTACGAG ACCAGCCAGT GGAGCGCCAC GCTCCAGCGG
GAGCCCTACC TGGAGTCAGT CGGTCGGATC AGGCAGTACA TAACGGCCGG CGACTGCTAT
CAGGTCAACT TCACCTTCCG CCAGCAGTTC CGCTTCACGG GCGATCCCCA GGCATGGTTC
CACGATCTCT GCCGGGCCCA GAGAGCCCCT TTCTGTGCCT TCATCGATAC GGGATCGCTC
CGGGTCCTCT CCACCTCGCC CGAACTGTTC TTCGACCTGC GCCAGGGGAC CCTCACCTGC
CGCCCCATGA AGGGAACCGC CCGCAGGGGA CGCTGGCGGG CCGAGGACGA GGAGTTACGC
GCGGGACTTG CCGCCAGCGA GAAGGAGCGG GCCGAAAACC TGATGATCGT CGACCTGCTG
CGCAACGACA TGGGAATGGT GGCTGAAACG GGCTCGGTGC GGGTGGAGTC GCTCTTTGAC
GTGGAAAGCC TCGAAACGCT CCACCAGATG ACCTCCACCA TCACGGCCCG GCCGCAGGCC
GGGGTCGGCC TCGCCGATCT CTTCCGGGCG CTCTTCCCCT GCGGGTCGGT GACCGGTGCG
CCCAAGCGGC GGAGCATGGA GATAATCCGG GAGCTGGAGG ATTCGCCCCG GGGGATCTAC
ACCGGCGCCA TCGGCTACGT CTCCCCGGCG GCGCAGGGGG CACCCGCCCC CTTTGAGGCG
ACCTTCAGTG TCGCCATCAG GACAGTGGTC CTGGACGCCG CATCGGGGCA GGGGCAGTTG
GGCATCGGCA GCGGTGTGAC CATCGGCTCG ACCCCTTCGT CGGAGTATGA CGAGTGCCTC
GCCAAGAGCA GATTCGCCCG GGAGCGTGTC CCCGACTTCC AGTTGGTGGA GACGCTGCTC
CACGAGGAAG GAGCGGGATT TTTCCTGCTG GAGCGCCATC TGGCGCGACT CTACCGGTCA
GCCGCCCATT TCGGGATTCC GCTCCGGCTC GGCAGCCTCC AGGAGATCCT CAACCGACGG
GCCGCCCTGA TGGAGGGTCG GCAAAAGGTG CGCGTACTGG TGAACCGGCG GGGGGCGTTC
ACCATCCAGG AAGCACCGCT GACCGAAGCG CCCTGCCCGG AACCGATTCC CGTCCGCTTT
GCGGCCACGT CAGTGGACCC GGCCGATCAG TTCCTCTACC ACAAGACCAC CTACCGCCCC
CTCTACCGGC ACGAACTGGC GGCGGCGCCC GACTGCGCAG ACGTCATCTT CGTAAACCGG
CACGGTGAAG TGACCGAGGG AACCACGGCC AATGTGGCCG CCCGCATCGA CGGGGAAATG
GTCACCCCTC CCCTTGCCGC CGGCATCCTC CCCGGCACCT TCCGGGAAGA GCTCCTGGCC
GAGGGCGCCC TCCGCGAACG GCCCATCACG CGGGAGGAAC TGGAACGGTG CCCGGAGATC
TACCTCATCA ACTCGGTCCG CCGGTGGCGG CCGGTGACTC TCATCACCTG A
 
Protein sequence
MSGAPTVILA SFDAERHSAS YRFEEFVEAV TALTPAEVVP ALRRVEAAVA GGLHAAGFVS 
YEAAPGLDET LTTREPVPDT PLVWFGLFRR RIGFAPRLPE CEQDVPPGYE TSQWSATLQR
EPYLESVGRI RQYITAGDCY QVNFTFRQQF RFTGDPQAWF HDLCRAQRAP FCAFIDTGSL
RVLSTSPELF FDLRQGTLTC RPMKGTARRG RWRAEDEELR AGLAASEKER AENLMIVDLL
RNDMGMVAET GSVRVESLFD VESLETLHQM TSTITARPQA GVGLADLFRA LFPCGSVTGA
PKRRSMEIIR ELEDSPRGIY TGAIGYVSPA AQGAPAPFEA TFSVAIRTVV LDAASGQGQL
GIGSGVTIGS TPSSEYDECL AKSRFARERV PDFQLVETLL HEEGAGFFLL ERHLARLYRS
AAHFGIPLRL GSLQEILNRR AALMEGRQKV RVLVNRRGAF TIQEAPLTEA PCPEPIPVRF
AATSVDPADQ FLYHKTTYRP LYRHELAAAP DCADVIFVNR HGEVTEGTTA NVAARIDGEM
VTPPLAAGIL PGTFREELLA EGALRERPIT REELERCPEI YLINSVRRWR PVTLIT