Gene Bpro_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_1842 
Symbol 
ID4015513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp1903679 
End bp1904776 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content65% 
IMG OID637941511 
Productchorismate synthase 
Protein accessionYP_548673 
Protein GI91787721 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.473892 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.107896 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGCA GTACCTTTGG CAATCTCTTC GCAGTCACCA ACTTTGGTGA ATCCCACGGC 
CCGGCCATTG GCTGCGTGAT TGACGGCTGC CCGCCGGGGC TGGCGCTGAC CGAAGCGGAT
ATCCAGACCG ATCTGGACCG CCGCCGTCCG GGCACCAGCC GCCATGTGAC GCAGCGCAAC
GAACCCGACG CGGTGGAAAT CCTGTCGGGC GTGTACGAGG GCAAGACCAC CGGCACGCCG
ATCTGCCTGC TGATCAGGAA CACCGACCAG CGCAGCAAGG ACTACGGCAA CATCCTGGAA
ACCTTCCGGC CCGGCCATGC CGACTACAGC TACCTGCACA AATACGGGCG GCGTGACCCC
CGCGGCGGCG GCCGCGCCTC GGCCCGCCTG ACGGCGCCCA TGGTGGCCGC CGGCGCGGTG
GCCAAAAAAT GGCTGGCTGA GAAATATGGC ACCAGCTTTC GCGGCTGCAT GGCGCAGATT
GGCGACATCG CGATTCCCTT TGAGTCCTGG GAGCATGTGC CGCGCAATCC CTTCTTTGCG
CCGGTGGCCG ACGTTTCCCA CCTTGAAGAC TACATGGATG CACTGCGCAA GGCCGGTGAC
TCCTGCGGCG CGCGCATCCG GGTCACCGCT TCCGGTGTGC CCGTCGGGCT GGGCGAGCCG
CTGTTTGACA AGCTCGATGC CGACATCGCA TTTGCCATGA TGGGGATCAA TGCCGTCAAG
GGCGTGGAGA TCGGCGCCGG CTTTGCCAGC GTGACCCAGC GCGGAACAAC CCATGGCGAC
TCACTGTCGC CCGAAGGTTT CCTTTCGAAC AATGCCGGTG GTGTGCTCGG CGGCATCAGC
ACTGGGCAGG ACCTGGAAGT CTCGATCGCC ATCAAGCCCA CGAGCTCCAT CATCACACCG
CGCCAGTCGA TAGACACGGC GGGCAACCCC GCCGAGGTGG TGACCAAGGG CCGGCACGAC
CCCTGCGTGG GCATTCGCGC CACGCCGATT GCCGAGGCCA TGCTGGCGCT CGTCGTGATG
GAGCATGCGC TGCGCCAGCG TGCGCAAAAT GCCGATGTGA CGGTCAGCAC GCCGGACATC
ATGCGCGCAC GCGGCTGA
 
Protein sequence
MSGSTFGNLF AVTNFGESHG PAIGCVIDGC PPGLALTEAD IQTDLDRRRP GTSRHVTQRN 
EPDAVEILSG VYEGKTTGTP ICLLIRNTDQ RSKDYGNILE TFRPGHADYS YLHKYGRRDP
RGGGRASARL TAPMVAAGAV AKKWLAEKYG TSFRGCMAQI GDIAIPFESW EHVPRNPFFA
PVADVSHLED YMDALRKAGD SCGARIRVTA SGVPVGLGEP LFDKLDADIA FAMMGINAVK
GVEIGAGFAS VTQRGTTHGD SLSPEGFLSN NAGGVLGGIS TGQDLEVSIA IKPTSSIITP
RQSIDTAGNP AEVVTKGRHD PCVGIRATPI AEAMLALVVM EHALRQRAQN ADVTVSTPDI
MRARG