Gene VC0395_A2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2008 
SymbolparE 
ID5137422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2158781 
End bp2160661 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content53% 
IMG OID640533465 
ProductDNA topoisomerase IV subunit B 
Protein accessionYP_001217932 
Protein GI147674220 
COG category[L] Replication, recombination and repair 
COG ID[COG0187] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit 
TIGRFAM ID[TIGR01055] DNA topoisomerase IV, B subunit, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000820641 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAC AATATAATGC TGGGGCCATT GAGGTTCTCA ATGGCTTAGA ACCAGTGCGT 
CGCAGACCGG GAATGTATAC CGACACCACG CGCCCCAATC ACCTTGGGCA AGAAGTGATC
GACAACTCCG TCGATGAAGC GCTGGCCGGA TACGCCTCCA AAATTCAGGT AATTCTTCAT
GCCGATCAAT CGCTCGAAGT GATTGATGAC GGGCGAGGCA TGCCCGTCGA TATTCACCCC
GAAGAGAAGG TCTCAGGGGT GGAGCTTATC CTGTGCAAAC TGCACGCCGG TGGTAAGTTT
TCCAACAAAA ACTACCAATT TTCCGGTGGC TTGCATGGGG TGGGTATTTC GGTGGTGAAT
GCGCTCTCCA AACGGGTAGA AGTGGCGGTT CGCCGTGATG GTCAAGTGTA TGAAATTGCC
TTTGAACATG GCGATAAAGT GACTGAGCTG ACGGTTACGG GGACTTGCGG CCGTCGCAAT
ACCGGTACTA GCGTCCATTT CTGGCCAGAC ACCAAATATT TTGACTCACC GAATTTCTCC
GTATCGCGCC TGATTAATAA CTTGCGTGCC AAGGCGGTAC TTTGCCCCGG TTTAGAAATC
ATCTTTACCG ATAAGGTCAA TAACAACGAG CACCGCTGGT TGTACGAAGA TGGTCTAAAA
GATTATCTGG CCGAAGGGGT GAAAGGTTAT ACCTTACTGC CGGAAGAGCC GTTTACCGGC
GAATTTACTG CGCAAACCGA AGCGGCAACG TGGGCGGTGA TTTGGCTACC CGAAGGTGGC
GAGCTCATCA CCGAAAGCTA CGTTAACCTG ATCCCAACTG CGCAAGGTGG TACCCATGTC
AACGGTTTAC GCCAAGGCTT GCTGGATGCG ATGCGCGAGT TTTGTGAGTT CCGCAATCTG
CTGCCACGCG GGGTAAAATT GACAGGCGAG GACGTGTTTG ACCGCTGTGC TTATGTGCTG
TCGATCAAGA TGCAAGATCC GCAGTTTGCT GGCCAAACCA AAGAGCGCCT ATCTTCACGC
CAATCGGCGG CGTTTGTCTC CGGTGTGGTG AAAGATGCCT TTAGCTTGTG GCTTAACGAA
AAACCACAAC TGGCAGAGCA GTTAGCCGAA GTTTGTATTG CTAATGCGCA CAGCCGAATG
CGTGCCAGCA AGAAAGTGGT ACGTAAGAAG ATCGCTTCAG GCCCTGCCTT ACCGGGTAAA
TTAACCGACT GTACAGTGCA AGACCTTAAC CGCACCGAAC TCTTCCTGGT GGAAGGGGAT
TCGGCGGGCG GCTCGGCGAA GCAAGCGCGG GATCGTGAAT TCCAAGCGGT GATGCCACTG
CGCGGCAAAA TCCTCAATAC ATGGGAAGTT TCTGCCGATC AAGTATTGGC TTCACAAGAA
GTACACGACA TCTCAGTTGC ATTGGGGATT GACCCTGACA GCGATAACCT CGAAGCACTG
CGTTACGGCA AGGTCTGTAT CCTTGCCGAT GCGGACTCGG ATGGTCTGCA CATTGCGACC
TTGCTTTGCG CCCTATTTAC CCGCCATTTC CGCGCGCTGG TGAAAGCGGG TCACGTCTAT
GTTGCCATGC CGCCGCTCTA CCGCATCGAC TGTGGTAAAG AGGTGTTCTA CGCACTCGAT
GATCAAGAAA AAGAGGGCGT GTTAGAGCGT CTGAGCCAGA AAAAAGCCAA AGTGAACGTG
CAACGCTTTA AAGGCTTGGG TGAAATGAAC CCGCTGCAAT TGCGTGAAAC CACCATGGAT
CCCAACACTC GCCGCTTGGT GCAACTGACC ATTGATGATG CTGAAGCGAC CGACGAGATG
ATGGATATGC TGCTGGGTAA AAAGCGTGCG GATGACCGCC GAGCTTGGCT GCAACGTAAC
GGCGATATGG CAGAGGTGTA A
 
Protein sequence
MTEQYNAGAI EVLNGLEPVR RRPGMYTDTT RPNHLGQEVI DNSVDEALAG YASKIQVILH 
ADQSLEVIDD GRGMPVDIHP EEKVSGVELI LCKLHAGGKF SNKNYQFSGG LHGVGISVVN
ALSKRVEVAV RRDGQVYEIA FEHGDKVTEL TVTGTCGRRN TGTSVHFWPD TKYFDSPNFS
VSRLINNLRA KAVLCPGLEI IFTDKVNNNE HRWLYEDGLK DYLAEGVKGY TLLPEEPFTG
EFTAQTEAAT WAVIWLPEGG ELITESYVNL IPTAQGGTHV NGLRQGLLDA MREFCEFRNL
LPRGVKLTGE DVFDRCAYVL SIKMQDPQFA GQTKERLSSR QSAAFVSGVV KDAFSLWLNE
KPQLAEQLAE VCIANAHSRM RASKKVVRKK IASGPALPGK LTDCTVQDLN RTELFLVEGD
SAGGSAKQAR DREFQAVMPL RGKILNTWEV SADQVLASQE VHDISVALGI DPDSDNLEAL
RYGKVCILAD ADSDGLHIAT LLCALFTRHF RALVKAGHVY VAMPPLYRID CGKEVFYALD
DQEKEGVLER LSQKKAKVNV QRFKGLGEMN PLQLRETTMD PNTRRLVQLT IDDAEATDEM
MDMLLGKKRA DDRRAWLQRN GDMAEV