Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3325 |
Symbol | parE |
ID | 6146043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3402026 |
End bp | 3403918 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618154 |
Product | DNA topoisomerase IV subunit B |
Protein accession | YP_001745304 |
Protein GI | 170679703 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0187] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit |
TIGRFAM ID | [TIGR01055] DNA topoisomerase IV, B subunit, proteobacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.707036 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCAAA CTTATAACGC TGATGCCATT GAGGTACTCA CCGGGCTTGA GCCGGTTCGC CGCCGTCCGG GGATGTATAC CGATACCACT CGCCCTAACC ATTTGGGGCA AGAAGTTATT GATAACAGTG TCGATGAAGC ACTGGCGGGC CACGCAAAAC GCGTGGATGT AATCTTACAT GCCGACCAGT CGTTAGAAGT GATTGACGAT GGGCGCGGGA TGCCGGTAGA TATTCACCCG GAAGAGGGGG TACCGGCGGT TGAACTGATT CTTTGCCGTC TGCACGCGGG CGGTAAATTC TCTAACAAAA ATTACCAGTT CTCTGGCGGC CTGCATGGCG TGGGGATTTC GGTGGTTAAC GCCCTGTCGA AGCGCGTAGA AGTTAACGTA CGCCGCGATG GTCAGATCTA TAACATCGCC TTTGAAAATG GCGAAAAGGT GCAAGATTTA CAGGTTATCG GCACTTGCGG TAAACGCAAT ACCGGTACCA GCGTGCACTT CTGGCCGGAT GAAACCTTCT TTGACAGCCC GCGTTTTTCT GTTTCACGCC TGACGCATGT GCTGAAAGCC AAAGCGGTAC TGTGCCCTGG TGTTGAGATC ACCTTTAAAG ATGAGATCAA CAACACCGAA CAGCGCTGGT GCTATCAGGA CGGTCTGAAT GATTACCTGG CGGAAGCGGT AAACGGTTTA CCGACGCTGC CAGAAAAACC GTTTATCGGT AATTTCGCTG GCGATACTGA AGCGGTGGAC TGGGCGCTAC TGTGGCTGCC GGAAGGCGGT GAACTGCTGA CCGAAAGCTA CGTCAACCTG ATCCCAACGA TGCAGGGTGG TACGCACGTT AATGGCTTGC GTCAGGGCCT GCTGGATGCG ATGCGCGAGT TCTGTGAATA TCGCAACATT CTGCCGCGCG GCGTGAAGCT GTCGGCGGAA GATATCTGGG ATCGCTGCGC CTATGTGCTG TCAGTAAAAA TGCAGGATCC ACAGTTTGCC GGGCAGACCA AAGAGCGTCT CTCTTCGCGT CAGTGCGCGG CATTTGTTTC TGGCGTGGTG AAAGATGCCT TCACCTTGTG GCTGAACCAG AACGTTCAGG CGGCTGAACT GCTGGCGGAG ATGGCGATTT CCAGCGCCCA GCGCCGTATG CGTGCGGCCA AAAAAGTGGT GCGTAAAAAG CTGACCAGCG GCCCGGCGTT GCCTGGCAAA CTGGCTGACT GTACCGCGCA GGACCTTAAC CGTACCGAGC TGTTCCTTGT GGAAGGTGAC TCCGCAGGCG GATCTGCCAA GCAGGCGCGC GATCGCGAAT ATCAGGCGAT CATGCCACTG AAAGGTAAGA TCCTTAACAC CTGGGAAGTC TCTTCCGACG AAGTGCTGGC TTCGCAGGAA GTGCACGATA TTTCGGTAGC TATCGGTATC GATCCTGACA GCGACGATTT GAGCCAGCTT CGTTACGGCA AGATCTGTAT CCTGGCGGAT GCTGACTCCG ATGGTCTGCA CATTGCCACG CTGCTCTGCG CTTTGTTTGT AAAACACTTC CGCGCATTGG TGAAACACGG TCACGTTTAC GTTGCACTGC CACCGCTCTA CCGTATTGAC CTCGGGAAAG AGGTTTATTA CGCGCTGACG GAAGAAGAGA AAGAGGGCGT ACTTGAGCAA TTAAAACGCA AGAAAGGCAA GCCAAACGTC CAGCGTTTTA AAGGTCTCGG GGAAATGAAC CCGATGCAAT TGCGCGAAAC CACGCTTGAT CCGAACACTC GCCGTCTGGT GCAGTTGACT ATCGATGATG AAGACGATCA GCGTACTGAC GCGATGATGG ATATGCTGCT GGCGAAGAAA CGCTCGGAAG ATCGCCGCAA CTGGTTGCAA GAGAAAGGCG ACATGGCAGA GATTGAGGTC TGA
|
Protein sequence | MTQTYNADAI EVLTGLEPVR RRPGMYTDTT RPNHLGQEVI DNSVDEALAG HAKRVDVILH ADQSLEVIDD GRGMPVDIHP EEGVPAVELI LCRLHAGGKF SNKNYQFSGG LHGVGISVVN ALSKRVEVNV RRDGQIYNIA FENGEKVQDL QVIGTCGKRN TGTSVHFWPD ETFFDSPRFS VSRLTHVLKA KAVLCPGVEI TFKDEINNTE QRWCYQDGLN DYLAEAVNGL PTLPEKPFIG NFAGDTEAVD WALLWLPEGG ELLTESYVNL IPTMQGGTHV NGLRQGLLDA MREFCEYRNI LPRGVKLSAE DIWDRCAYVL SVKMQDPQFA GQTKERLSSR QCAAFVSGVV KDAFTLWLNQ NVQAAELLAE MAISSAQRRM RAAKKVVRKK LTSGPALPGK LADCTAQDLN RTELFLVEGD SAGGSAKQAR DREYQAIMPL KGKILNTWEV SSDEVLASQE VHDISVAIGI DPDSDDLSQL RYGKICILAD ADSDGLHIAT LLCALFVKHF RALVKHGHVY VALPPLYRID LGKEVYYALT EEEKEGVLEQ LKRKKGKPNV QRFKGLGEMN PMQLRETTLD PNTRRLVQLT IDDEDDQRTD AMMDMLLAKK RSEDRRNWLQ EKGDMAEIEV
|
| |