Gene Plav_3356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3356 
Symbol 
ID5454053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3594404 
End bp3595453 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content65% 
IMG OID640878946 
Productaminodeoxychorismate lyase 
Protein accessionYP_001414617 
Protein GI154253793 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.681449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGAACG GGGACGATAC GCCGGACGCG TCTGAGCAAG CCTCTGCGCC GAAGAAGTCG 
CGGTTGGGCC GCTATCTTCT GCTTTCCGCG CTCGTCCTGC CGCTCCTCGC TGCCCTTCTG
GCTGCAAGTA TTTTTCTGTA CGGGAAATAC CGGTTCGAGG CGCACGGGCC GCATGAGGAA
GCCGTCGTCG TCCTGCTTGC GCCCGGTACA GGCGTCCGCG CCATCGCGTC GCTGCTGGAC
CGGGAAGGCG TTATTTCCGA CCCCATGATC TTCCTCGCCG GTGTCCGCTT CCACCGCGCG
GAGGGAGACC TCAAGGCCGG CGAATACCGC ATACCCGCCC ACGCCAGCAT GGCCGCGATC
ATGGGCATTC TGCGCGAAGG CCGCTCGATA CTTCACCGCA TCACCATCCC CGAAGGCTTG
ACCAGCGAGC AGGCAATGCT GCTCGTCGCC GCCAATCCTG TGCTGCTCGG CGAGATGCCG
CCCGTCCCCG CGGAAGGCAA AATACTGCCC GAGACCTACA GCTTCACGCG CGGCGCCACG
CGGGCGGAAA TCGTTGCCGA GATGCAGAAA GCGGCGAGCG ACCTGCTGGA GCGCTTGTGG
GAAGCCCGCG CCGAAAATCT GCCGGTCAAA ACGAAGGAAG AAGCGGTCAT TCTCGCATCC
ATCGTGGAGA AGGAAACAGG CGTCGCTTCC GAGCGTCCCC GCGTCGCGGC CGTCTTCACC
AATCGCCTGC GCAAGCCCAT GCGCCTCCAG TCCGACCCCA CGATCATCTA CGGTCTGGTC
GGAGGCAAAG GCGCTCTGGG CCGTCCGATC CGCCGTAGCG AGCTCGACCG GCTGACCCCC
TATAACACCT ATCTCGTGGA CGGCCTGCCG CCGACGCCCA TCTGCAATCC CGGCAAGGCC
TCGCTCGAAG CGGTGCTCAA TCCCCCCGAT ACCGATGAGT TCTATTTTGT TGCCGATGGC
ACCGGCGGCC ACGCCTTCTC CCGTACGCTG GCCGAACATT TGGAGCGGGT CCGTGAATGG
CGGCAGATCG AGCGCCAGAA GGCGCAGTAG
 
Protein sequence
MTNGDDTPDA SEQASAPKKS RLGRYLLLSA LVLPLLAALL AASIFLYGKY RFEAHGPHEE 
AVVVLLAPGT GVRAIASLLD REGVISDPMI FLAGVRFHRA EGDLKAGEYR IPAHASMAAI
MGILREGRSI LHRITIPEGL TSEQAMLLVA ANPVLLGEMP PVPAEGKILP ETYSFTRGAT
RAEIVAEMQK AASDLLERLW EARAENLPVK TKEEAVILAS IVEKETGVAS ERPRVAAVFT
NRLRKPMRLQ SDPTIIYGLV GGKGALGRPI RRSELDRLTP YNTYLVDGLP PTPICNPGKA
SLEAVLNPPD TDEFYFVADG TGGHAFSRTL AEHLERVREW RQIERQKAQ