Gene Plav_3591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3591 
Symbol 
ID5454997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3836725 
End bp3838011 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content63% 
IMG OID640879175 
Productmembrane dipeptidase 
Protein accessionYP_001414846 
Protein GI154254022 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGA ACGGACGGAG ATACGGCATT GCCGCCGCGC TCGCGGTTCT GTTGCTGGCC 
GCCCTCTGGA CGCTGCGCCC CGGTCTCACC CCCGCCGAAG CGGAGCCGAC GCCGGAAGAA
ATCGCCGCCC GCATCCACAA GAGCGCCATC GTCATCGACA CCCATGTCGA TATTCCCTCC
TTCTTCGGCT CCGCCTTGTA CGATCCCGGC CTCCGCAATG CCTATCCCGT CCAGGTCGAT
CTGCCGCGCA TGCGCGAGGG CGGCCTCGAT GCGGCGTTCT TCATCGTCTA TGTCTCGCAG
ACAGAGCGCG GTGCCGTCGG CTATGCGGAA GCTGCGTCCG AAGCATTGGC GAAATTCGCC
GCCATCCGCC GCATGACGGA TATTCAGTAC AAGGACGAGA TCGGCCTTGC GCTCGACGCG
GCGGATGTCC GGCGGCTTCA TGGCGAGGGC AAGCGCATCG CGCTCATCGG CATCGAAAAC
GGCTACTCGG TGGCGAAAGA GCCCGCTCTT CTCGACTTCT ATTATGACCT CGGCGCGCGC
TATTTCGGCC TCGTCCATAA TGGCCACAAC GATCTTTCCG ACAGCGCCCA GCCGCAGGAG
AAATTCGCCG ACAAGCCGAA CGAGGAAGGT GGCGAGCATG ACGGGTTGAG CGAACTCGGC
CGCGCCATGG TCGCGCGCGC AAACGATCTC GGCCTCATGG TCGATGTCTC CCACGCGTCT
CGTGCCGCCG CGCTCGACGC AATCGCCGCC TCCCGCGCAC CCGTCATCGC ATCTCATTCC
TCCGTCCATG CCCTTCGCCC CCATCCGCGC AACATGACGG ATGAGGAAAT GCTGGCGCTG
AAGGAAAAAG GCGGCGTCAT CCAGATCGTC GCTTTCGACG AATATCTCCA TGATGTGCCC
GAGGAGAAAA AGGCCGCCCG GCGCGATCTC GCCGTCTCGC TTGGCCTCAC AAGCCTCGAT
GCCTTCTTCT CGGCGGATGC CGAAACGAAA TCGAAATTCG TCGCGGGCGT TGCCGAGCTC
GACGCAAAAT GGCCGCGCGC CACCGTCGCG ACCCTTGCCG ATCATATCGA CTATGCGGTG
AAGCTCATCG GCATCGACCA TGTCGGCATC GCGTCGGATT TTCAGGGCGG CGGCGGCATC
GAGGGCTGGT CCCATGCGGG CGAAACGGCG AATGTCACCA TCGAACTGGT GCGGCGCGGC
TATGACGAGG AGCAGATCGC AAAGCTCTGG GGCGGCAACC AGCTCCGCGT CATGGAAGCC
GCCGAAAAGG CGCGGAAGGC CAAATAG
 
Protein sequence
MKMNGRRYGI AAALAVLLLA ALWTLRPGLT PAEAEPTPEE IAARIHKSAI VIDTHVDIPS 
FFGSALYDPG LRNAYPVQVD LPRMREGGLD AAFFIVYVSQ TERGAVGYAE AASEALAKFA
AIRRMTDIQY KDEIGLALDA ADVRRLHGEG KRIALIGIEN GYSVAKEPAL LDFYYDLGAR
YFGLVHNGHN DLSDSAQPQE KFADKPNEEG GEHDGLSELG RAMVARANDL GLMVDVSHAS
RAAALDAIAA SRAPVIASHS SVHALRPHPR NMTDEEMLAL KEKGGVIQIV AFDEYLHDVP
EEKKAARRDL AVSLGLTSLD AFFSADAETK SKFVAGVAEL DAKWPRATVA TLADHIDYAV
KLIGIDHVGI ASDFQGGGGI EGWSHAGETA NVTIELVRRG YDEEQIAKLW GGNQLRVMEA
AEKARKAK