Gene EcolC_2354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2354 
Symbol 
ID6065617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2592982 
End bp2595579 
Gene Length2598 bp 
Protein Length865 aa 
Translation table11 
GC content52% 
IMG OID641601757 
ProductDNA topoisomerase I 
Protein accessionYP_001725316 
Protein GI170020362 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.01053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000472675 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGGTAAAG CTCTTGTCAT CGTTGAGTCC CCGGCAAAAG CCAAAACGAT CAACAAGTAT 
CTGGGTAGTG ACTACGTGGT GAAATCCAGC GTCGGTCACA TCCGCGATTT GCCGACCAGT
GGCTCAGCTG CCAAAAAGAG TGCCGACTCT ACCTCCACCA AGACGGCTAA AAAGCCTAAA
AAGGATGAAC GTGGCGCTCT CGTCAACCGT ATGGGGGTTG ACCCGTGGCA CAATTGGGAG
GCGCACTATG AAGTGTTGCC TGGTAAAGAG AAGGTCGTCT CTGAACTGAA ACAACTGGCT
GAAAAAGCCG ACCACATCTA TCTCGCAACC GACCTTGACC GCGAAGGGGA AGCCATTGCA
TGGCACCTGC GGGAAGTGAT TGGGGGTGAT GATGCGCGCT ATAGCCGAGT GGTGTTTAAC
GAAATTACTA AAAACGCGAT CCGCCAGGCA TTTAACAAAC CGGGTGAGCT GAATATTGAT
CGTGTTAATG CCCAGCAGGC GCGTCGCTTT ATGGACCGCG TGGTGGGGTA TATGGTTTCG
CCGCTGCTAT GGAAAAAGAT CGCTCGTGGT CTGTCTGCCG GTCGTGTGCA GTCGGTGGCA
GTCCGCCTGG TGGTCGAGCG TGAGCGTGAA ATTAAAGCGT TCGTGCCGGA AGAGTTCTGG
GAAGTCGATG CCAGCACGAC CACGCCATCT GGTGAAGCGT TGGCGTTGCA GGTGACTCAT
CAGAACGACA AACCGTTCCG TCCGGTCAAC AAAGAACAAA CTCAGGCTGC GGTAAGTCTG
CTGGAAAAAG CGCGCTACAG CGTGCTGGAA CGTGAAGACA AACCGACAAC CAGTAAACCT
GGCGCTCCTT TTATTACCTC TACGCTGCAA CAAGCTGCCA GCACCCGTCT TGGGTTTGGC
GTGAAAAAAA CCATGATGAT GGCGCAGCGT TTGTATGAAG CAGGCTATAT CACTTACATG
CGTACCGACT CCACTAACCT GAGTCAGGAC GCGGTAAATA TGGTTCGCGG TTATATCAGC
GATAATTTTG GTAAGAAATA TCTGCCGGAA AGTCCGAATC AGTACGCCAG CAAAGAAAAC
TCACAGGAAG CGCACGAAGC GATTCGTCCT TCTGACGTCA ATGTGATGGC GGAATCGCTG
AAGGATATGG AAGCAGATGC GCAGAAACTG TACCAGTTAA TCTGGCGTCA GTTCGTTGCC
TGCCAGATGA CCCCAGCGAA ATATGACTCC ACGACGCTGA CCGTTGGTGC GGGCGATTTC
CGCCTGAAAG CACGCGGTCG TATTTTGCGT TTTGATGGCT GGACAAAAGT GATGCCTGCG
TTGCGTAAAG GCGATGAAGA TCGCATCTTA CCAGCAGTTA ATAAAGGCGA TGCTCTGACG
CTCGTTGAAC TGACACCAGC CCAGCACTTT ACCAAGCCGC CAGCCCGTTT CAGTGAAGCA
TCGCTGGTTA AAGAACTGGA AAAACGTGGT ATCGGTCGTC CGTCTACCTA TGCGTCGATC
ATTTCGACCA TTCAGGATCG TGGCTATGTG CGAGTAGAAA ATCGTCGTTT CTATGCGGAA
AAAATGGGCG AAATCGTCAC CGATCGCCTG GAAGAGAATT TCCGCGAGTT AATGAACTAC
GACTTCACCG CGCAGATGGA AAACAGCCTT GACCAGGTGG CAAATCACGA AGCAGAGTGG
AAAGCTGTAC TGGATCACTT CTTCTCGGAT TTCACCCAGC AGTTAGATAA AGCTGAAAAA
GATCCGGAAG AGGGTGGTAT GCGCCCGAAC CAGATGGTTC TGACCAGCAT TGACTGCCCG
ACTTGTGGTC GCAAAATGGG GATTCGCACA GCGAGCACCG GGGTATTCCT TGGCTGTTCT
GGCTATGCGC TGCCGCCGAA AGAGCGTTGC AAAACCACCA TTAACCTGGT GCCGGAAAAC
GAAGTGCTGA ACGTGCTGGA AGGCGAAGAC GCTGAAACCA ACGCGCTGCG CGCAAAACGT
CGTTGCCCGA AATGCGGCAC GGCGATGGAC AGCTATCTCA TCGATCCGAA ACGTAAGTTG
CATGTCTGTG GTAATAACCC AACCTGCGAC GGTTACGAGA TCGAAGAGGG CGAATTCCGC
ATTAAAGGTT ATGACGGCCC GATCGTTGAG TGTGAAAAAT GTGGCTCTGA AATGCACCTG
AAAATGGGGC GTTTCGGTAA ATACATGGCC TGCACCAACG AAGAGTGTAA AAACACACGT
AAGATTTTAC GTAACGGCGA AGTGGCACCA CCGAAAGAAG ATCCGGTGCC ATTACCTGAG
CTGCCGTGCG AAAAATCAGA TGCTTATTTC GTGCTGCGTG ACGGTGCTGC CGGTGTGTTC
CTGGCTGCCA ACACTTTCCC GAAATCGCGT GAAACGCGTG CGCCACTGGT GGAAGAGCTT
TATCGCTTCC GCGACCGTCT GCCGGAAAAA CTGCGTTATC TGGCCGATGC GCCACAGCAG
GATCCGGAAG GTAATAAGAC CATGGTTCGC TTTAGCCGTA AAACCAAACA GCAATATGTC
TCTTCGGAAA AAGACGGAAA GGCGACTGGC TGGTCAGCAT TTTATGTTGA TGGCAAATGG
GTTGAAGGAA AAAAATAA
 
Protein sequence
MGKALVIVES PAKAKTINKY LGSDYVVKSS VGHIRDLPTS GSAAKKSADS TSTKTAKKPK 
KDERGALVNR MGVDPWHNWE AHYEVLPGKE KVVSELKQLA EKADHIYLAT DLDREGEAIA
WHLREVIGGD DARYSRVVFN EITKNAIRQA FNKPGELNID RVNAQQARRF MDRVVGYMVS
PLLWKKIARG LSAGRVQSVA VRLVVERERE IKAFVPEEFW EVDASTTTPS GEALALQVTH
QNDKPFRPVN KEQTQAAVSL LEKARYSVLE REDKPTTSKP GAPFITSTLQ QAASTRLGFG
VKKTMMMAQR LYEAGYITYM RTDSTNLSQD AVNMVRGYIS DNFGKKYLPE SPNQYASKEN
SQEAHEAIRP SDVNVMAESL KDMEADAQKL YQLIWRQFVA CQMTPAKYDS TTLTVGAGDF
RLKARGRILR FDGWTKVMPA LRKGDEDRIL PAVNKGDALT LVELTPAQHF TKPPARFSEA
SLVKELEKRG IGRPSTYASI ISTIQDRGYV RVENRRFYAE KMGEIVTDRL EENFRELMNY
DFTAQMENSL DQVANHEAEW KAVLDHFFSD FTQQLDKAEK DPEEGGMRPN QMVLTSIDCP
TCGRKMGIRT ASTGVFLGCS GYALPPKERC KTTINLVPEN EVLNVLEGED AETNALRAKR
RCPKCGTAMD SYLIDPKRKL HVCGNNPTCD GYEIEEGEFR IKGYDGPIVE CEKCGSEMHL
KMGRFGKYMA CTNEECKNTR KILRNGEVAP PKEDPVPLPE LPCEKSDAYF VLRDGAAGVF
LAANTFPKSR ETRAPLVEEL YRFRDRLPEK LRYLADAPQQ DPEGNKTMVR FSRKTKQQYV
SSEKDGKATG WSAFYVDGKW VEGKK