Gene Plav_1105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_1105 
Symbol 
ID5455204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp1214387 
End bp1216072 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content64% 
IMG OID640876675 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_001412383 
Protein GI154251559 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.359933 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCCCT CGATGCGGAA CGGAACCATG AGCAAGTCCA TAAAAGCCCA GAATAAAGAA 
CGCCTCTACC TCTTCGACAC CACGCTGCGC GACGGCGCGC AGACGACCGG TGTCGATTTC
AGCGTCGAGG ACAAGCGCCT CATCGCCACC CTGCTGGACG AGCTCGGCAT CGACTACATC
GAGGGCGGCT ATCCGGGCGC GAACCCGACC GACACCGCCT TCTTCGGCGA CCGGCCGAAA
CTCGCCCACG CGAAATTCAC CGCCTTCGGC ATGACCAAGC GCGCCGGCCG CAGCGCCTCC
AACGATCCCG GCCTCGCCGC CACGCTCGAC GCCGACACGG ATGCCGTCTG CCTCGTCGCC
AAAAGCTCCT CCTTCCAGGT CGCCGTCGCG CTCGGCATCT CGGAAGAGGA AAACCTCGAA
GGCATCGCCG AAAGCGTGAA GGCCGTCGCC GCCCGCGGCC GCGAGCCGAT GATCGACTGC
GAACATTTCT TCGACGGCTA CAAGCTCAAC CGGGACTACG CGCTCGCCTG CGTCCACGCC
GCGCTCGATA ACGGCGCCCG CTGGGCGGTC CTCTGCGACA CCAATGGTGG CACGCTCCCG
GATGAAGTGG AGCGCATCGT CGGCGAGGTC ATCGCCTCGG GCGTCCCAGG CGAAAAACTC
GGCATCCATG CCCACAACGA CACGGAAAAC GCCGTCGCGA ATTCGCTGGC CGCCATCCGC
GCCGGCGTCC GCCAGGTGCA AGGCACGTTG AACGGGCTGG GCGAGCGGTG CGGCAACGCC
AACATCGTCT CGATCGTCCC GACGCTGCTG CTCAAACCTC TCTATGCGGA TCGTTTCGAA
ATCGGCGTCA CCCATGAGCG CCTGAAAAGC CTCGTCCATG TCTCGCGCAC GCTCGACGAA
ATCCTGAACC GCGCGCCGAA CCGCTACGCC CCTTATGTCG GCGAAAGCGC CTTCGCGTCG
AAAGCGGGCA TCCACGTCTC CGCGATCCTG AAGGACCCGA CGACCTATGA GCATGTGCCG
CCCGAAAGCG TCGGCAACAA GCGCCGGATC GCCGTCTCCG ATCAGGCCGG AAAATCGAAC
ATCCTCGCCC GCCTCGAAGA AGCGGGCATC GCCGTGGACC CGCAGGACCG CCGCATCTCC
CGCCTGCTGG ACGAGGTGAA GGAGCGCGAG TTCCTTGGCT ATTCCTATGA CGGCGCCGAA
GCCTCCTTCG AGCTTCTCGC CCGCCGCATC CTCGGCACCG TGCCGGAATT CTTCGACGTC
GAATCCTTCC GCGTCCTCGT CGAGCGCCGC TACAACGCGA TAGGCGATCT CGTCACCGTC
TCGGAAGCAA CCGTGAAAGT CGTGGTCGAT GGCGAGAAAT TCATCTCGGT GGGCGAGGGT
AACGGCCCCG TCAATGCGCT CGACCAGGCG CTGAGAAAGG ACCTTGGCAA ATACTCGCCC
TATATCGAGG ATCTGAGCCT CGCCGACTTC AAGGTCCGTA TCCTCACCAG CGGCACCGAA
GCCGTCACCC GCGTCATGAT CGAAAGCATG GACCGCGCGG GAGAACGCTG GTTCACGGTC
GGCGTCTCGC CCAACATCGT CGACGCCTCC TTCCAGGCGC TCACGGATTC GATCACCTAC
AAACTCCTCC GCGACAACGC CCCCGTCCCC GCCACGCATA CCCGCAAGGA AAAGGCGGGC
GCCTGA
 
Protein sequence
MRPSMRNGTM SKSIKAQNKE RLYLFDTTLR DGAQTTGVDF SVEDKRLIAT LLDELGIDYI 
EGGYPGANPT DTAFFGDRPK LAHAKFTAFG MTKRAGRSAS NDPGLAATLD ADTDAVCLVA
KSSSFQVAVA LGISEEENLE GIAESVKAVA ARGREPMIDC EHFFDGYKLN RDYALACVHA
ALDNGARWAV LCDTNGGTLP DEVERIVGEV IASGVPGEKL GIHAHNDTEN AVANSLAAIR
AGVRQVQGTL NGLGERCGNA NIVSIVPTLL LKPLYADRFE IGVTHERLKS LVHVSRTLDE
ILNRAPNRYA PYVGESAFAS KAGIHVSAIL KDPTTYEHVP PESVGNKRRI AVSDQAGKSN
ILARLEEAGI AVDPQDRRIS RLLDEVKERE FLGYSYDGAE ASFELLARRI LGTVPEFFDV
ESFRVLVERR YNAIGDLVTV SEATVKVVVD GEKFISVGEG NGPVNALDQA LRKDLGKYSP
YIEDLSLADF KVRILTSGTE AVTRVMIESM DRAGERWFTV GVSPNIVDAS FQALTDSITY
KLLRDNAPVP ATHTRKEKAG A