Gene Plav_3340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3340 
Symbol 
ID5455640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3577530 
End bp3579761 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content62% 
IMG OID640878930 
Productorganic solvent tolerance protein 
Protein accessionYP_001414601 
Protein GI154253777 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.491125 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCACTTGA GTGAGGGGCT CTGGATGGGT TTGGGCAAGG GGCGCGGCAT CCTGTGGCCG 
ATAGCTGTGC TTATAGCGGT CACGCTGGCC GGGGGGCCGG CCCGCGCGCA AGAGCCCGCC
CCCGCCCGGC AGCAAGCCTT TCCCGAATCC GGCGAAGTGC TGATGCAGGC CGACCAGCTT
TCCTATGACA GGGATGCACG CGTGGTGACG GCGAGCGGCC ATGTGGAGCT TGCCTATGGC
GAGCGGGTGC TGCTTGCCGA CCGCGTCACC TATAACGAGA CGACCGGCGT CGTGACCGCG
GACGGCAATG TGTCGCTGCT CGACCCGCAA GGCGACGTGG CCTTTGCCGA CCACCTGGTG
CTGCGCAATG AAATGCGCGA CGGCGTGATC CAGACACTGC GCGTGCTGCT GAGCGACAAT
TCGCGGCTTG CGGGCCATGA CGTGGTGCGC AGCGGCGGCA ACATGACGAC GCTCCACCGG
GGCGTCTACA GCCCGTGCGA CATCTGCGAG AAAAAAGGAC AGACGACGCC GATCTGGCAG
ATCAAGGCCT TCCGCGTCAT CCACAACAAG GAGAAGCAGA GGATTATCTA TGAAGACGCC
TTCATGGAGT TCTTCGGCGT TCCCGTTTTT TATGTTCCCT ATTTCTCGCA GCCTGACCCG
ACCGTGAAGC GGCAGTCGGG TTTTCTCACG CCTTCCTTCG GTAGCTCTTC GGATCTCGGG
CAGCAGGTGG AAATTCCCTA TTACTGGGCG ATCGCGCCGG ACAAGGACGC GACGATCTCG
CCGCGCTTCA CGAGCAAGGA AGGCACGGTC TATCAGGGCG AATACCGGCA GCGCTTCGAG
AGCGGCCAGT TCGAGATGTT CGGCACGGCC ACCTGGCCGC GCAACCAGCA AGTGGGCACG
CCGGCGGAAA ACGACTTTCG CGGCAGCTTG TTCGGTCAGG GCGATTTCAC GCTCGATGAA
AACTGGCGCT GGGGCTTTCG CTCCGAGCTC GCTTCGGACG ACACCTATCT GCGCCGCTAC
GACCTGTCGT CGGCGACCGA CCTGATCAGC AATGTGAACA CGACCTATAT AGACGGGCGC
AACAGCTTCA CGGCGGACGC CTATTATTTC CGCGGCCTGC TGGCGGCGGA CGACACGGCG
ACCACGCCCT GGGTCGCGCC GCTGATGCAA TATGAGTATT CCTATCCAGA TCAGGTCGCC
GGCGGGCGCA TCGGCTTTTC GGCCAATGCC ATGGTGCTGC AACGCCGCGA AGGCGCGAAG
TCGCGCCGGG TATCAAGCTC CGTCAGATGG GACCGCCGGG AGACATCCGC GAGCGGCTTC
GTCTATCGCC TGTTCGGCAG CCTGCGCGGC GATGTCTATT CGGTCGAAGA CGTGCCGAAC
CCCGCCTTTC CGGCCGCGAC CTTCGACAGC TCGACGATCA CCCGGGCATT GCCGACCATC
GGCACGGAAT GGAGCTATCC TCTCGTCCGC TCAGAGAGCG GCCTGAGGCA GGTGCTGGAG
CCGATCGCGC AGCTCATCTA TTCGCCCAAT GTCGGCAATA CCGAGGAAAT ACCGAACGAG
GATTCGCTGA GCTTCGAGTT CGACGATACC AACCTCTTTT CGGAAGACCG CTTCGTCGGC
TTCGACCGCT GGGAAACGGG GGCGCGGGCC AATCTCGGCG TCCGTTACTC GGTTTACACG
CAGGGAGGCG GCCAGGCGAA CGTGCTGTTT GGCCAGAGCT TTCGAGTGAA CAATAATGAC
AGCGTAGCCG CATCGACCGG GCTGCAGGAC GATACATCGG ATTATGTCGG CCGGGTCATG
GTTGCGCCTT CCGATGATTT TCTGCTGGTC TACCGCTTCC GGCTCGACGA CGAGAACTAC
AAGATCCGGC GCAACGAACT CAACTTCCTC GGCCGGTTCG GCCCGTTGAC CGGCGATATC
GGCTATGCGT ATTTTGCGCC GGACCAGTCC CTGACCTTCC AGGCGCGCGA AGAGGCCTAT
ATCGGCAGCG TGCTGAGGCT CGACCGGTAT TGGCGGGTAT TCGGCCAGAC GCGGCGGGAT
ATCGAAAACG ACCGCACGGT GGCAAACAGG CTCGGAGTAG GCTACGGAGA TGAATGCCTC
GATGTCTCGC TCGGCCTGTA TCAGTCGTTT TACCGGGACA GGGACATAGA ACCGGAAAAT
TCGGTGATCC TTCAGATTAC CTTCAAGACG CTGGGAAGCG CCCAGGTCTC GGGCTCCGCC
GGCTCGAACT AA
 
Protein sequence
MHLSEGLWMG LGKGRGILWP IAVLIAVTLA GGPARAQEPA PARQQAFPES GEVLMQADQL 
SYDRDARVVT ASGHVELAYG ERVLLADRVT YNETTGVVTA DGNVSLLDPQ GDVAFADHLV
LRNEMRDGVI QTLRVLLSDN SRLAGHDVVR SGGNMTTLHR GVYSPCDICE KKGQTTPIWQ
IKAFRVIHNK EKQRIIYEDA FMEFFGVPVF YVPYFSQPDP TVKRQSGFLT PSFGSSSDLG
QQVEIPYYWA IAPDKDATIS PRFTSKEGTV YQGEYRQRFE SGQFEMFGTA TWPRNQQVGT
PAENDFRGSL FGQGDFTLDE NWRWGFRSEL ASDDTYLRRY DLSSATDLIS NVNTTYIDGR
NSFTADAYYF RGLLAADDTA TTPWVAPLMQ YEYSYPDQVA GGRIGFSANA MVLQRREGAK
SRRVSSSVRW DRRETSASGF VYRLFGSLRG DVYSVEDVPN PAFPAATFDS STITRALPTI
GTEWSYPLVR SESGLRQVLE PIAQLIYSPN VGNTEEIPNE DSLSFEFDDT NLFSEDRFVG
FDRWETGARA NLGVRYSVYT QGGGQANVLF GQSFRVNNND SVAASTGLQD DTSDYVGRVM
VAPSDDFLLV YRFRLDDENY KIRRNELNFL GRFGPLTGDI GYAYFAPDQS LTFQAREEAY
IGSVLRLDRY WRVFGQTRRD IENDRTVANR LGVGYGDECL DVSLGLYQSF YRDRDIEPEN
SVILQITFKT LGSAQVSGSA GSN