Gene EcolC_0678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0678 
Symbol 
ID6065587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp728594 
End bp730852 
Gene Length2259 bp 
Protein Length752 aa 
Translation table11 
GC content55% 
IMG OID641600085 
ProductDNA topoisomerase IV subunit A 
Protein accessionYP_001723681 
Protein GI170018727 
COG category[L] Replication, recombination and repair 
COG ID[COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit 
TIGRFAM ID[TIGR01062] DNA topoisomerase IV, A subunit, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.508953 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATA TGGCAGAGCG CCTTGCGCTA CATGAATTTA CGGAAAACGC CTACTTAAAC 
TACTCCATGT ACGTGATCAT GGACCGTGCG TTGCCGTTTA TTGGTGATGG TCTGAAACCT
GTTCAGCGCC GCATTGTGTA TGCGATGTCT GAACTGGGCC TGAATGCCAG CGCCAAATTT
AAAAAATCGG CCCGTACCGT CGGTGACGTA CTGGGTAAAT ACCATCCGCA CGGCGATAGC
GCCTGTTATG AAGCGATGGT CCTGATGGCG CAACCGTTCT CTTACCGTTA TCCGCTGGTT
GATGGTCAGG GGAACTGGGG CGCGCCGGAC GATCCGAAAT CGTTCGCGGC AATGCGTTAC
ACCGAATCCC GGTTGTCGAA ATATTCCGAG CTGCTATTGA GCGAGCTGGG GCAGGGGACG
GCTGACTGGG TGCCAAACTT CGACGGCACT TTGCAGGAGC CGAAAATGCT ACCTGCCCGT
CTGCCAAACA TTTTGCTTAA CGGCACCACC GGTATTGCCG TCGGCATGGC GACCGATATT
CCACCGCATA ACCTGCGTGA AGTGGCTCAG GCGGCAATCG CATTAATCGA CCAGCCGAAA
ACCACGCTCG ATCAGCTGCT GGATATCGTG CAGGGGCCGG ATTATCCGAC TGAAGCGGAA
ATTATCACTT CGCGCGCCGA GATCCGTAAA ATCTACGAGA ACGGACGTGG TTCAGTGCGT
ATGCGCGCGG TGTGGAAGAA AGAAGATGGC GCGGTGGTTA TCAGCGCATT GCCGCATCAG
GTTTCAGGTG CGCGCGTACT GGAGCAAATT GCTGCGCAAA TGCGCAACAA AAAGCTGCCG
ATGGTTGACG ATCTGCGCGA TGAATCTGAC CACGAGAACC CGACCCGCCT GGTGATTGTG
CCGCGTTCCA ACCGCGTGGA TATGGATCAG GTGATGAACC ACCTCTTCGC TACCACCGAT
CTGGAAAAGA GCTATCGTAT TAACCTTAAT ATGATCGGTC TGGATGGTCG TCCGGCGGTG
AAAAACCTGC TGGAAATCCT CTCCGAATGG CTGGTGTTCC GCCGCGATAC CGTGCGCCGC
CGACTGAACT ATCGTCTGGA GAAAGTCCTC AAGCGCCTGC ATATCCTCGA AGGTTTGCTG
GTGGCGTTTC TCAATATCGA CGAAGTGATT GAGATCATTC GTAATGAAGA TGAACCGAAA
CCGGCGCTGA TGTCGCGGTT TGGCCTTACG GAAACCCAGG CGGAAGCGAT CCTCGAACTG
AAACTGCGTC ATCTTGCCAA ACTGGAAGAG ATGAAGATTC GCGGTGAGCA GAGTGAACTG
GAAAAAGAGC GCGACCAGTT GCAGGGCATT TTGGCTTCCG AGCGTAAAAT GAATAACCTG
CTGAAGAAAG AACTGCAGGC AGACGCGCAA GCCTACGGTG ACGATCGTCG TTCGCCGTTG
CAGGAACGCG AAGAAGCGAA AGCGATGAGC GAGCACGACA TGCTGCCGTC TGAACCTGTC
ACCATTGTGC TGTCGCAGAT GGGCTGGGTA CGCAGCGCTA AAGGCCATGA TATCGACGCG
CCGGGCCTGA ATTATAAAGC GGGTGATAGC TTCAAAGCGG CGGTGAAAGG TAAGAGCAAC
CAACCGGTAG TGTTTGTTGA TTCCACCGGT CGTAGCTATG CCATTGACCC GATTACGCTG
CCGTCGGCGC GTGGTCAGGG CGAGCCGCTC ACCGGCAAAT TAACGTTGCC GCCTGGGGCG
ACCGTTGACC ATATGCTGAT GGAAAGCGAC GATCAGAAAC TGCTGATGGC TTCCGATGCG
GGTTACGGTT TCGTCTGCAC CTTTAACGAT CTGGTGGCGC GTAACCGTGC AGGTAAGGCT
TTGATCACCT TACCGGAAAA TGCCCATGTT ATGCCGCCGG TGGTGATTGA AGATGCTTCC
GATATGCTGC TGGCAATCAC TCAGGCAGGC CGTATGTTGA TGTTCCCGGT AAGTGATCTG
CCGCAGCTGT CGAAGGGCAA AGGCAACAAG ATTATCAACA TTCCATCGGC AGAAGCCGCG
CGTGGAGAAG ATGGTCTGGC GCAATTGTAC GTTCTGCCGC CGCAAAGCAC GCTGACCATT
CATGTTGGGA AACGCAAAAT TAAACTGCGC CCGGAAGAGT TACAGAAAGT CACTGGCGAA
CGTGGACGCC GCGGTACGTT GATGCGCGGT TTGCAGCGTA TCGATCGTGT TGAGATCGAC
TCTCCTCGCC GTGCCAGCAG CGGTGATAGC GAAGAGTAA
 
Protein sequence
MSDMAERLAL HEFTENAYLN YSMYVIMDRA LPFIGDGLKP VQRRIVYAMS ELGLNASAKF 
KKSARTVGDV LGKYHPHGDS ACYEAMVLMA QPFSYRYPLV DGQGNWGAPD DPKSFAAMRY
TESRLSKYSE LLLSELGQGT ADWVPNFDGT LQEPKMLPAR LPNILLNGTT GIAVGMATDI
PPHNLREVAQ AAIALIDQPK TTLDQLLDIV QGPDYPTEAE IITSRAEIRK IYENGRGSVR
MRAVWKKEDG AVVISALPHQ VSGARVLEQI AAQMRNKKLP MVDDLRDESD HENPTRLVIV
PRSNRVDMDQ VMNHLFATTD LEKSYRINLN MIGLDGRPAV KNLLEILSEW LVFRRDTVRR
RLNYRLEKVL KRLHILEGLL VAFLNIDEVI EIIRNEDEPK PALMSRFGLT ETQAEAILEL
KLRHLAKLEE MKIRGEQSEL EKERDQLQGI LASERKMNNL LKKELQADAQ AYGDDRRSPL
QEREEAKAMS EHDMLPSEPV TIVLSQMGWV RSAKGHDIDA PGLNYKAGDS FKAAVKGKSN
QPVVFVDSTG RSYAIDPITL PSARGQGEPL TGKLTLPPGA TVDHMLMESD DQKLLMASDA
GYGFVCTFND LVARNRAGKA LITLPENAHV MPPVVIEDAS DMLLAITQAG RMLMFPVSDL
PQLSKGKGNK IINIPSAEAA RGEDGLAQLY VLPPQSTLTI HVGKRKIKLR PEELQKVTGE
RGRRGTLMRG LQRIDRVEID SPRRASSGDS EE