Gene Plav_0785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_0785 
Symbol 
ID5456810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp850882 
End bp851979 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content63% 
IMG OID640876354 
Productchorismate synthase 
Protein accessionYP_001412065 
Protein GI154251241 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.229169 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCACA ATACGTTCGG CCATCTGTTC CGCGTCACGA CCTGGGGCGA AAGCCACGGG 
CCGGCGCTCG GCTGCGTTGT GGACGGTGCG CCGCCGCGGC TGCCTTTGAA GGCCGAAGAT
ATCCAGCAAT GGCTCGACCG GCGCAAACCC GGCCAGTCGC GGTTCACCAC GCAGCGGCGC
GAGCCCGATG CGGTGAAAAT TCTCTCGGGC ACCTTCGTCG AAGACGGCAT AGAGATGACG
ACGGGCACGC CGATCTCGCT GATGATCGAG AATGTCGATC AGCGGTCGAA GGACTATGGC
GATATCGTCG AGAAGTTCAG ACCGGGCCAT GCCGATCTCA CCTATTTCCT GAAATATGGC
ATTCGCGATT ATCGCGGGGG CGGCCGCTCT TCGGCGCGTG AAACGGCGGC CCGCGTGGCT
GCCGGCGCGG TGGCGCGGGC GATGTTGCCG GAGATGATGA TCCGGGGTGC GCTCGTGCAG
ATGGGGCCGC ACAAGATCGA CCGCGCCAAC TGGGACTGGA ACGAGGTGGG AAACAACCCC
TTCTGGTGCC CGGACGCAAA GGCAGCGGCG GAATGGGAAA TCTATCTCGA TAGCGTCCGG
AAAGCCGGTT CGTCCTGCGG TGCCGTCATC GAGATTGTAG CCAGCGGCGT ACCCGCCGGT
CTCGGCTCAC CTATCTACGG AAAGCTCGAT GCGGAACTTG CAAGCGCGTT GATGAGCATA
AACGCGGTGA AGGGTGTCGA GATCGGAGAT GGCTTCGGCG CTGCCGCTCT CTCCGGCGAA
GAAAATGCCG ACGAGATGCA GTCGGGGCCG CATGGCATTG AGTTCAGCTC CAATCACGCG
GGCGGCGTTC TTGGCGGCAT TTCCACGGGA CAAGATGTCG TCGCGCGCTT TGCCGTGAAA
CCCACTTCCT CGATCCTCAG CCCTCGCAAA ACAGTCACCA AAGGCGGCGA CGACACGGAA
ATCGTCACCA AGGGCCGCCA TGACCCATGC GTCGGCATCC GCGCCGTGCC TGTGGGCGAA
GCGATGATGG CCTGCGTGCT TGCTGACCAG CTGCTCCGCC ATCGGGCGCA GATGGGCGGG
AGCGGCCGCA ATGAGTGA
 
Protein sequence
MSHNTFGHLF RVTTWGESHG PALGCVVDGA PPRLPLKAED IQQWLDRRKP GQSRFTTQRR 
EPDAVKILSG TFVEDGIEMT TGTPISLMIE NVDQRSKDYG DIVEKFRPGH ADLTYFLKYG
IRDYRGGGRS SARETAARVA AGAVARAMLP EMMIRGALVQ MGPHKIDRAN WDWNEVGNNP
FWCPDAKAAA EWEIYLDSVR KAGSSCGAVI EIVASGVPAG LGSPIYGKLD AELASALMSI
NAVKGVEIGD GFGAAALSGE ENADEMQSGP HGIEFSSNHA GGVLGGISTG QDVVARFAVK
PTSSILSPRK TVTKGGDDTE IVTKGRHDPC VGIRAVPVGE AMMACVLADQ LLRHRAQMGG
SGRNE