Gene Pnap_2745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_2745 
Symbol 
ID4688496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp2889188 
End bp2890285 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content66% 
IMG OID639835753 
Productchorismate synthase 
Protein accessionYP_982968 
Protein GI121605639 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0513827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCA ATACCTTTGG AAATCTCTTC GCAGTCACCA ACTTTGGTGA ATCCCACGGT 
CCGGCCATTG GCTGCGTGAT TGACGGCTGC CCGCCGGGCA TGGCGCTGTC AGAGGCCGAT
ATCCAGGGCG ACCTGGACCG GCGCCGGCCG GGCACCAGCC GCCACGTCAC GCAGCGCAAC
GAGCCCGACG CGGTCGAAAT CCTGTCCGGG GTCTATGAAG GCAAGACCAC CGGCACGCCG
ATCTGCCTCT TGATCAAGAA CACCGACCAG CGCAGCAAGG ACTACGGCAA CATCCTCGAT
ACCTTCCGGC CCGGCCATGC CGACTACACC TATCTGCACA AGTACGGCCT GCGCGACCCG
CGCGGCGGCG GCAGGTCGTC GGCCCGGCTG ACGGCACCCA TGGTGGCGGC CGGCGCGGTG
GCCAAGAAGT GGCTGTTTGA GAAATACGGC ACGACGTTTC GCGGCTGCAT GGCGCAGATC
GGCGAGGCCA TGATTCCGTT CGAGTCCTGG GAGCATGTGG CCAACAACCC ATTTTTCGCC
CCGGTGGCCG ACGTGTCAAA CCTTGAAAAC TACATGGACG CGCTGCGCAA GGCGGGCGAC
TCGTGCGGCG CGCGCATCCG CGTGGTAGCC TCTGGCGTGC CGGTCGGCCT GGGCGAGCCG
CTGTTCGACA AGCTGGATGC CGACATCGCC TTTGCCATGA TGGGCATCAA CGCCGTCAAG
GGCGTGGAAA TCGGCGCCGG CTTTGCCAGC GTGACGCAGC GCGGCACCAC GCACGGCGAT
TCGCTGTCGC CCGAAGGTTT CATGTCGAAC AATGCCGGCG GCGTGCTTGG CGGCATCAGC
ACCGGGCAGG ACCTGGAAGT GTCGATTGCC ATCAAGCCGA CCAGCTCCAT CATCACGCCG
CGCCAGTCGA TTGACACGGC CGGCAACCCG GCCGAGGTGG TGACCAAGGG CCGGCACGAC
CCCTGCGTCG GCATTCGCGC CACGCCGATT GCCGAAGCCA TGCTGGCGCT GGTCGTCATG
GAACACGCCC TGCGCCAGCG CGCGCAATGC GGCGATGTGA AGGTCAGCAC GCCGGACATC
ATGCGCAGCC GGGGCTGA
 
Protein sequence
MSGNTFGNLF AVTNFGESHG PAIGCVIDGC PPGMALSEAD IQGDLDRRRP GTSRHVTQRN 
EPDAVEILSG VYEGKTTGTP ICLLIKNTDQ RSKDYGNILD TFRPGHADYT YLHKYGLRDP
RGGGRSSARL TAPMVAAGAV AKKWLFEKYG TTFRGCMAQI GEAMIPFESW EHVANNPFFA
PVADVSNLEN YMDALRKAGD SCGARIRVVA SGVPVGLGEP LFDKLDADIA FAMMGINAVK
GVEIGAGFAS VTQRGTTHGD SLSPEGFMSN NAGGVLGGIS TGQDLEVSIA IKPTSSIITP
RQSIDTAGNP AEVVTKGRHD PCVGIRATPI AEAMLALVVM EHALRQRAQC GDVKVSTPDI
MRSRG