Gene Pnap_0939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_0939 
Symbol 
ID4689708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp987194 
End bp988969 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content71% 
IMG OID639833937 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_981177 
Protein GI121603848 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACTCATA TTTCCGCATG CATTGACTTT TCCAGCCCGC AGGGCGGCGC CGCGCCGCGC 
CTGCGCCACG TCTTTGGCAC GCCGCGCGCC GTGCTGGTCG CGCATGAACT GGCCGAGGTG
CGCGCGGTGC TCGACGCCGT GCAGGCCGCC GCCGAAAATG GAAGCTGGTG CGTCGGCTAC
CTGCGCTATG AAGCCGCCCC GGCATTTGAC GCCGCGCTGG CCGTGCATGC GCCCGATGGC
CCGCTGGCCT GGTTTGCCGT GCATGACGAC GCCTTGCCCT GGGTTGAGGA CGCCAGCGCC
GCAACGGCCC GGGTCCAATG GCTTGACACC TGCCCGCGCC CCGCGTTTGG CGCGGCCATG
GACCAGATCC AGCGCGCCAT CGCGGCTGGC GAGCTGTACC AGGTCAACTT CACCGCGCCG
CTTCTGGGCG AGTGGGCCGG CGAGCCCGAA GCAGGCGCGG CGCAAGCCCT GTTCGCCGCC
CTGCAGCGCG CCCAGCCGGG CGGCTACGCG GCCTTCATCG ACACCGGCAA CACGGGCGAC
GGCCAGTTGC TGTCGGTGTC GCCCGAGCTG TTTTTTGACT GGCAGGATGG CCAGATTCTG
GCGCGGCCCA TGAAGGGCAC GGCCGCGCGC GGCGCCACGC CCGAGATGGA TGCCGCCCAG
GCCGCCGCCC TGCGCGCATC GCCCAAGGAG CGCGCCGAAA ACGTCATGAT CGTCGATCTG
CTGCGCAACG ACCTCTCGCG CATCGCTGAG CCTTTCAGCG TGCAGGTGCC CGCCTTGTTC
CACACCGAAG CGCTGCCCAC GGTCTGGCAG ATGACCTCCG ACGTGCGCGC CCGCACCCGC
GCCGGCACCA CGCTGGCCGA TGTGTTCGCC GCGCTGTTTC CGTGCGGCTC GGTCACCGGC
GCGCCCAAGG TGCGCGCCAT GCAGATGATC CGCAAACTCG AAGCCGGGCC GCGCGGCGTG
TACTGCGGCG CCATCGGCGT GGTGCGCCCG GGCAAGGACG GCCATGGAAT CCGGGCCACC
TTCAACGTGC CGATCCGCAC CGTCAGCGTG CAGGCCGGCG GCCTGCGCTG CGGCATTGGC
AGCGGCATCA CCTCGGGCGC CGTGCCTGAC GCCGAGTGGC AGGAGTGGCG CAACAAACGA
CAATTTCTGG AGCGTGCCAG CATGCCTTTC GACCTTCTGG AAACCCTGGC GCTAAAGGAT
GGCCAGCTGC GCCATGCCGC CGAGCACCTG CAGCGCCTGG CCGGCGCCGC CGCGCATTTC
GGCATTCCAT GGGATGCCGC CGCCGTGCGG CATTGCCTGC ATGAACTGGC GCAAGCGCAC
CCGCAAGACC TGTGGCGCGT GCGCCTGCTG CTCGATGCCC GGGGCCGGGC ACGCGCCGAG
GCGTTCGCCA TGGACCCGTC GCCCGCCCAG GTGCGGCTGC GGCTGGCGGA GCGCCCGCTT
GAAGACGCGC ACGGCGAGTT CGTGCGCTTC AAGACCACGC GCCGCGCGCA TTACGACGCC
TTCACGCCCA CTGAATCCGG CGTGTTCGAC ACCGTGTTGT GGAATACCGA AGGCGAGATC
ACCGAATGCA CGCGCGGCAA CGTGGCCATG CTGCTGGACG GCCGCTGGGT CACGCCGCCG
CTGGCCTGCG GCCTGCTGCC CGGCGTCGGG CGCGCGCTGG CGCTGCGGGA AGGCCGGCTG
ACCGAGGCCG TGGTGCGGCT GGAAGACCTG CCCCGCGTGC AGGGCTGGGC GTTCGTGAAC
AGCCTGCGCG GATGGCTGGC AGCAGAGAAG GTTTGA
 
Protein sequence
MTHISACIDF SSPQGGAAPR LRHVFGTPRA VLVAHELAEV RAVLDAVQAA AENGSWCVGY 
LRYEAAPAFD AALAVHAPDG PLAWFAVHDD ALPWVEDASA ATARVQWLDT CPRPAFGAAM
DQIQRAIAAG ELYQVNFTAP LLGEWAGEPE AGAAQALFAA LQRAQPGGYA AFIDTGNTGD
GQLLSVSPEL FFDWQDGQIL ARPMKGTAAR GATPEMDAAQ AAALRASPKE RAENVMIVDL
LRNDLSRIAE PFSVQVPALF HTEALPTVWQ MTSDVRARTR AGTTLADVFA ALFPCGSVTG
APKVRAMQMI RKLEAGPRGV YCGAIGVVRP GKDGHGIRAT FNVPIRTVSV QAGGLRCGIG
SGITSGAVPD AEWQEWRNKR QFLERASMPF DLLETLALKD GQLRHAAEHL QRLAGAAAHF
GIPWDAAAVR HCLHELAQAH PQDLWRVRLL LDARGRARAE AFAMDPSPAQ VRLRLAERPL
EDAHGEFVRF KTTRRAHYDA FTPTESGVFD TVLWNTEGEI TECTRGNVAM LLDGRWVTPP
LACGLLPGVG RALALREGRL TEAVVRLEDL PRVQGWAFVN SLRGWLAAEK V