Gene EcSMS35_3310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3310 
SymbolparC 
ID6145863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3386493 
End bp3388751 
Gene Length2259 bp 
Protein Length752 aa 
Translation table11 
GC content55% 
IMG OID641618139 
ProductDNA topoisomerase IV subunit A 
Protein accessionYP_001745289 
Protein GI170681468 
COG category[L] Replication, recombination and repair 
COG ID[COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit 
TIGRFAM ID[TIGR01062] DNA topoisomerase IV, A subunit, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.31337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0249309 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATA TGGCAGAGCG CCTTGCGCTA CATGAATTTA CGGAAAACGC CTACTTAAAC 
TACTCCATGT ACGTGATCAT GGACCGTGCG TTGCCGTTTA TTGGTGATGG TCTGAAACCC
GTTCAGCGGC GCATTGTGTA TGCGATGTCT GAACTGGGTC TGAATGCCAG TGCCAAATTT
AAAAAATCGG CCCGTACCGT CGGTGACGTA CTGGGTAAAT ACCATCCGCA CGGCGATATT
GCCTGTTATG GAGCGATGGT CCTGATGGCG CAGCCGTTCT CTTACCGTTA TCCGCTGGTT
GATGGTCAGG GGAACTGGGG CGCGCCGGAC GATCCGAAAT CGTTCGCGGC AATGCGTTAC
ACCGAATCCC GGTTATCGAA ATATTCTGAG TTGCTATTAA GCGAGCTTGG ACAGGGGACG
GCTGACTGGG TGCCAAACTT CGACGGCACC TTACAGGAGC CGAAAATGCT GCCAGCCCGC
CTGCCGAACA TTTTGCTTAA CGGCACCACC GGTATTGCCG TCGGCATGGC GACCGATATT
CCACCGCATA ACCTGCGTGA AGTGGCTCAG GCGGCAATCG CATTAATCGA CCAGCCGAAA
ACCACGCTCG ATCAGCTGCT GGATATCGTG CAGGGGCCGG ATTATCCGAC TGAAGCGGAA
ATTATCACTT CGCGCGCCGA GATTCGTAAA ATCTACGAGA ACGGACGTGG TTCAGTGCGT
ATGCGCGCGG TTTGGAAGAA AGAAGATGGC GCGGTGGTTA TTAGCGCATT GCCGCATCAG
GTTTCAGGTG CGCGCGTACT GGAGCAAATT GCTGCGCAAA TGCGCAACAA AAAGCTGCCG
ATGGTTGACG ATCTGCGCGA TGAATCTGAC CACGAGAACC CGACCCGTCT GGTGATTGTG
CCGCGTTCCA ACCGCGTGGA TATGGATCAG GTGATGAACC ACCTCTTCGC TACCACCGAT
CTGGAAAAGA GCTATCGCAT TAACCTCAAT ATGATCGGTC TGGATGGTCG TCCGGCGGTG
AAAAACCTGC TGGAAATCCT CTCCGAATGG CTGGTGTTCC GCCGCGATAC CGTGCGCCGC
CGACTGAACT ATCGTCTGGA GAAAGTCCTC AAGCGCCTGC ATATCCTCGA AGGTTTGCTG
GTGGCGTTTC TCAATATCGA CGAAGTGATT GAGATCATTC GTAATGAAGA TGAACCGAAA
CCGGCGCTGA TGTCGCGGTT TGGCCTTACG GAAACCCAGG CGGAAGCGAT CCTCGAACTG
AAACTGCGTC ATCTTGCCAA ACTGGAAGAG ATGAAGATTC GCGGTGAGCA GAGTGAGCTG
GAAAAAGAGC GCGACCAGTT GCAGGGCATT TTGGCTTCCG AGCGTAAAAT GAATAACCTG
CTGAAGAAAG AACTGCAGGC AGACGCGCAA GCCTACGGTG ACGAACGTCG TTCGCCGTTG
CAGGAACGCG AAGAAGCGAA AGCGATGAGC GAGCACGACA TGCTGCCGTC TGAACCTGTC
ACCATTGTGC TGTCGCAGAT GGGCTGGGTA CGCAGCGCTA AAGGCCATGA TATCGACGCG
CCGGGCCTGA ATTATAAAGC GGGCGATAGC TTCAAAGCGG CGGTGAAAGG TAAGAGCAAC
CAACCGGTAG TGTTTGTTGA TTCCACCGGT CGTAGCTATG CCATCGACCC GATTACGCTG
CCGTCGGCGC GTGGTCAGGG CGAACCGCTC ACCGGCAAAT TAACGTTGCC GCCTGGAGCG
ACCGTTGACC ATATGCTGAT GGAAAGCGAC GATCAGAAAC TGCTGATGGC TTCCGATGCG
GGTTACGGTT TCGTCTGCAC CTTTAACGAT CTGGTGGCGC GTAACCGTGC AGGTAAGGCT
TTGATCACCT TACCGGAAAA TGCCCATGTT ATGCCGCCGG TGGTGATTGA GGATGCTTCC
GATATGCTGC TGGCAATCAC TCAGGCAGGC CGTATGTTGA TGTTCCCGGT AAGCGATCTG
CCGCAGTTGT CGAAGGGCAA AGGCAATAAG ATTATCAACA TTCCGTCGGC AGAAGCCGCG
CGTGGCGAGG ATGGTCTGGC GCAACTGTAC GTTCTACCAC CGCAAAGCAC GCTGACCATT
CATGTTGGGA AACGCAAAAT TAAACTGCGT CCGGAAGAGC TACAGAAAGT CACTGGCGAA
CGTGGACGCC GCGGTACGTT GATGCGCGGT TTGCAGCGTA TCGATCGTGT TGAGATCGAC
TCTCCTCGCC GTGCCAGCAG CGGTGATAGC GAAGAGTAA
 
Protein sequence
MSDMAERLAL HEFTENAYLN YSMYVIMDRA LPFIGDGLKP VQRRIVYAMS ELGLNASAKF 
KKSARTVGDV LGKYHPHGDI ACYGAMVLMA QPFSYRYPLV DGQGNWGAPD DPKSFAAMRY
TESRLSKYSE LLLSELGQGT ADWVPNFDGT LQEPKMLPAR LPNILLNGTT GIAVGMATDI
PPHNLREVAQ AAIALIDQPK TTLDQLLDIV QGPDYPTEAE IITSRAEIRK IYENGRGSVR
MRAVWKKEDG AVVISALPHQ VSGARVLEQI AAQMRNKKLP MVDDLRDESD HENPTRLVIV
PRSNRVDMDQ VMNHLFATTD LEKSYRINLN MIGLDGRPAV KNLLEILSEW LVFRRDTVRR
RLNYRLEKVL KRLHILEGLL VAFLNIDEVI EIIRNEDEPK PALMSRFGLT ETQAEAILEL
KLRHLAKLEE MKIRGEQSEL EKERDQLQGI LASERKMNNL LKKELQADAQ AYGDERRSPL
QEREEAKAMS EHDMLPSEPV TIVLSQMGWV RSAKGHDIDA PGLNYKAGDS FKAAVKGKSN
QPVVFVDSTG RSYAIDPITL PSARGQGEPL TGKLTLPPGA TVDHMLMESD DQKLLMASDA
GYGFVCTFND LVARNRAGKA LITLPENAHV MPPVVIEDAS DMLLAITQAG RMLMFPVSDL
PQLSKGKGNK IINIPSAEAA RGEDGLAQLY VLPPQSTLTI HVGKRKIKLR PEELQKVTGE
RGRRGTLMRG LQRIDRVEID SPRRASSGDS EE