Gene Pnap_3654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3654 
Symbol 
ID4686190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3888549 
End bp3890048 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content64% 
IMG OID639836672 
Productanthranilate synthase component I 
Protein accessionYP_983871 
Protein GI121606542 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.915059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTACCG AACTCGAATT TAAAAGCCTC GCGGCGCAAG GCTACAACCG CATTCCGCTG 
ATGCTCGAAG CATTCGCTGA CCTGGAAACT CCGCTGTCGC TCTACCTCAA GCTGGCCAAC
GCCAGGGACG GCGGAAAATT CAGCTTTTTG CTCGAATCGG TCGTCGGCGG CGAGCGCTTC
GGCCGCTACA GCTTCATCGG CTTGCCGGCC AGAACGCTGC TGCGCTCGTC GGGCTTTGGG
GATGACGCGC TGACCGAGGT CGTGACCGAT GGCCAGGTGG TTGAAACCTC CCGCGTCAAT
CCGCTGGACT TCATCAGCGA CTACCAGAAG CGCTTCAAGG TCGCGCTGCG GCCCGGACTG
CCGCGTTTTT GCGGCGGGCT GGCGGGCTAC TTCGGCTACG ACGCGGTGCG CTACATCGAG
AAAAAGCTGG AAAAAAGCTG CCCGCCCGAC ACCCTGGGCT GCCCCGACAT CCTGCTGCTG
CAGTGCGAGG AACTGGCGGT CATCGACAAC CTGTCGGGCA AGCTCTACCT GATCGTCTAT
GCCGACCCGG CCCGCCCGGA AGCCTATGCC AACGCCAAGA AGCGCCTGCG CGAACTCAAG
GAGCAGCTCA AGTATTCGGT CAGCGCGCCC AGCGTCAAAC CAAGCCAAGG CTACCCGGCC
GAGCGCGAAT TCGCCAAGGC TGACTACATC GCCGCCGTCG AGCGCGCCAA AAAGCTGATC
GAAGGCGGCG ACTTCATGCA GGTGCAGGTC GGCCAGCGCA TCAAGAAGCG CTACACCGAG
TCGCCGCTGT CGCTGTACCG CGCGCTGCGC TCGCTCAATC CGTCGCCCTA CATGTATTAC
TACCATTTCG GCGATTTCCA TGTGGTCGGC GCTTCGCCCG AAATCCTGGT GCGCCAGGAG
CAGGTGGAAG CGGGGCAAAA GATCACCATC CGCCCGCTGG CCGGCACCCG GCCGCGCGCC
TCGTCGCTGG AAGCCGACAA GGCCGCCGAG CACGAACTCA TCAACGACCC GAAGGAGCGC
GCCGAGCACG TCATGCTGAT CGACCTGGCG CGCAACGACA TCGGCCGCAT CGCCCAGACC
GGCACGGTGA AGGTGACCGA AGCCTTTGCC GTCGAGCGCT ACAGCCATGT GATGCACATC
GTCAGCAATG TCGAAGGCAT CCTGAACGAC GGCATGACCA GCATGGACGT GCTGCGGGCG
ACTTTTCCGG CCGGCACGCT GACCGGCGCG CCGAAAGTGC ATGCGATGGA GCTGATCGAC
CAGTTGGAGC CGACCAAGCG CGGCCTGTAC GGCGGCGCCT GCGGCTACCT GAGCTATGCC
GGCGACATGG ACGTGGCGAT TGCGATTCGC ACCGGCATCA TCAAGGACCA GACGCTGTAT
GTGCAGGCGG CGGCCGGCGT GGTGGCCGAC TCGGTGCCCG AACTGGAATG GAAAGAAACC
GAAGCCAAGG CGCGCGCGCT GCTGCGCGCC AGCGAACTGG TTGAAGAAGG ACTGGAGTAA
 
Protein sequence
MITELEFKSL AAQGYNRIPL MLEAFADLET PLSLYLKLAN ARDGGKFSFL LESVVGGERF 
GRYSFIGLPA RTLLRSSGFG DDALTEVVTD GQVVETSRVN PLDFISDYQK RFKVALRPGL
PRFCGGLAGY FGYDAVRYIE KKLEKSCPPD TLGCPDILLL QCEELAVIDN LSGKLYLIVY
ADPARPEAYA NAKKRLRELK EQLKYSVSAP SVKPSQGYPA EREFAKADYI AAVERAKKLI
EGGDFMQVQV GQRIKKRYTE SPLSLYRALR SLNPSPYMYY YHFGDFHVVG ASPEILVRQE
QVEAGQKITI RPLAGTRPRA SSLEADKAAE HELINDPKER AEHVMLIDLA RNDIGRIAQT
GTVKVTEAFA VERYSHVMHI VSNVEGILND GMTSMDVLRA TFPAGTLTGA PKVHAMELID
QLEPTKRGLY GGACGYLSYA GDMDVAIAIR TGIIKDQTLY VQAAAGVVAD SVPELEWKET
EAKARALLRA SELVEEGLE