Gene Plav_3385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3385 
Symbol 
ID5454402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3623979 
End bp3626009 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content66% 
IMG OID640878975 
ProductDNA topoisomerase III 
Protein accessionYP_001414646 
Protein GI154253822 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.191763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.100638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCTGT TCTTGTGCGA GAAGCCCTCC CAGGGCAAAG ACATTGGCCG GATTCTCGGC 
GCCACGCAGC GCGGTGAAGG CTGCCTCAAC GGTTCCGGCG TCACGGTCAC CTGGTGCATC
GGCCATCTCG TGGAGGCGGC GCCGCCCGAG GCCTACGACG AGCAGCTCAA ACGATGGTCC
GTTGAGCAGT TGCCCATCAT TCCCCAGCAC TGGCGGGTCG AGGTCAAACC GAAGACCGCC
ACGCAATTCA AGGTCGTCAA GGCGCTCTTG GCGAAGGCGA CTCACCTGGT TATCGCCACC
GATGCCGACC GCGAGGGCGA ATTGATCGCC CGCGAGATCG TGGAGCTTTG CGGCTACCGC
GGCCCCATCG AACGCCTGTG GCTGTCGGCG CTCAACGATG CGTCCATTCG GGCGGCACTG
GGCAAGCTGC GGCCTTCGGC CGAGACGCTT TCGATGTACC ACTCGGCGCT GGCGCGCTCC
CGTGCGGATT GGCTCGTGGG CATGAACCTG AGCCGGCTGT TCACGGTGCT GGGGCGACAG
GCGGGTTACG ACGGCGTGCT GTCGGTCGGC CGCGTACAGA CCCCGACGCT CAAGCTCGTT
GTGGACCGCG ACCGCGAGAT CGCGCGCTTC GTGTCCGTAC CATACTGGGC CATCGCCGTG
TCCCTGTTCG CAGGCGGTTC GACTTTCGCC GCGCAATGGG TTCCACCCGA TGCGTGCACC
GACGACGCAG GCCGCTGCCT GCGGCAGCCG GTCGCACAGC AGACCATGCA GCAGATCCGC
GCTGCGGGCA GTGCCCACGT CGTGTCGGTG GAGACTGAGC GTGTCCGCGA AGGCCCGCCG
CTGCCGTTCG ACCTGGGCAC CTTGCAGGAA GTGTGTTCCA AGCAGCTTGG GCTGGACGTG
CAGGAAACCT TGGAGATTGC CCAAGCCCTG TACGAGACGC ACAAGGCCAC GACGTACCCC
CGCTCGGACT CCGGCTACCT GCCCGAGAGC ATGTTTGCCG AAGTGCCCAC CGTTCTCGAC
AGCCTGCTCA AGACCGATCC CTCGCTGCGC TCGATCATGG GCCAGCTCGA CCGCTCGCAG
CGCTCGCGCG CCTGGAACGA TGGCAAGGTC ACGGCGCACC ACGGCATCAT CCCGACGCTC
GAACCGGCGA AGCTCTCCGC CATGAGCGAG AAGGAACTGG CCGTGTACAG GCTCATCCGG
TCGCATTACC TGGCGCAGTT CCTCCCTCAC CACGAGTTCG ACCGCACTGT GGCCAAGTTT
TCGTGCGGGG GGCAGAACCT GGCGGCCACG GGCAAGCAGG TTGTCATCCC GGGTTGGCGC
CAGGTGCTCG CCGAGCCGCA GGCCGAAGAC GGTGATGGCG AGGGCGATAC TGCGGTCCGC
GCCCAGGTGC TGCCCGCGCT GCCGAAACTG TATGAGGGCC TGGCATGCCA GGTGGCCGAC
GTCGATCTCA AGGCACTCAA GACGCTGCCG CCCAAACCGT ACACGCAAGG CGAGTTGGTC
AAGTCCATGA AAGGCGTCGC CAAGCTGGTG TCCGATCCCC GCCTGAAGCA GAAGCTCAAG
GATACGGTTG GCATCGGCAC CGAAGCGACG CGGGCCAACA TCATCGGCGG CCTGATCGCT
CGCGGCTACC TCGTGAAGAA GGGGCGCGCC ATCCGCGCCT CGGATGCGGC TTTCACTTTG
ATCGATGCCG TGCCTGCGGC GATTGCCGAC CCTGGCACCA CCGCCGTCTG GGAACAGGCG
CTCGACATGA TCGAGGCCGG ACAGCTCACC CTGGACGTGT TCATCGGCAA GCAGGCCGCG
TGGATTTCGC AGTTGATTGC GCAGTACGGC AGCGCCTCCC TGTCCATCAA GGTTCCCCAA
GGGCCGGCAT GCCCGCAGTG CGGCGCACCC ACGCGCCAGC GCAGCGGCAA GAGCGGCCCG
TTCTGGTCGT GCAGCCGCTA CCCGGACTGC AAAGGCACGC TGCCAGTCGA ATCCGGCAGC
TCCAAGCGCG GCGCCTCGCG CCCGCGCCGT AGCGGCCGCA AAGGCTCCTG A
 
Protein sequence
MRLFLCEKPS QGKDIGRILG ATQRGEGCLN GSGVTVTWCI GHLVEAAPPE AYDEQLKRWS 
VEQLPIIPQH WRVEVKPKTA TQFKVVKALL AKATHLVIAT DADREGELIA REIVELCGYR
GPIERLWLSA LNDASIRAAL GKLRPSAETL SMYHSALARS RADWLVGMNL SRLFTVLGRQ
AGYDGVLSVG RVQTPTLKLV VDRDREIARF VSVPYWAIAV SLFAGGSTFA AQWVPPDACT
DDAGRCLRQP VAQQTMQQIR AAGSAHVVSV ETERVREGPP LPFDLGTLQE VCSKQLGLDV
QETLEIAQAL YETHKATTYP RSDSGYLPES MFAEVPTVLD SLLKTDPSLR SIMGQLDRSQ
RSRAWNDGKV TAHHGIIPTL EPAKLSAMSE KELAVYRLIR SHYLAQFLPH HEFDRTVAKF
SCGGQNLAAT GKQVVIPGWR QVLAEPQAED GDGEGDTAVR AQVLPALPKL YEGLACQVAD
VDLKALKTLP PKPYTQGELV KSMKGVAKLV SDPRLKQKLK DTVGIGTEAT RANIIGGLIA
RGYLVKKGRA IRASDAAFTL IDAVPAAIAD PGTTAVWEQA LDMIEAGQLT LDVFIGKQAA
WISQLIAQYG SASLSIKVPQ GPACPQCGAP TRQRSGKSGP FWSCSRYPDC KGTLPVESGS
SKRGASRPRR SGRKGS