Gene EcolC_2864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2864 
Symbol 
ID6065246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3130854 
End bp3132875 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content54% 
IMG OID641602270 
Productexcinuclease ABC subunit B 
Protein accessionYP_001725819 
Protein GI170020865 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000193006 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0188114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAC CGTTCAAACT GAATTCCGCT TTTAAACCTT CTGGCGATCA GCCAGAGGCG 
ATTCGACGTC TCGAAGAGGG GCTGGAAGAT GGCCTGGCGC ACCAGACGTT ACTTGGCGTG
ACTGGCTCAG GGAAAACCTT CACCATTGCC AATGTCATTG CTGACCTTCA GCGCCCAACC
ATGGTACTTG CGCCCAACAA AACGCTGGCG GCCCAGCTGT ATGGCGAAAT GAAAGAGTTC
TTCCCGGAAA ACGCGGTGGA ATATTTCGTT TCCTACTACG ACTACTATCA GCCGGAAGCC
TATGTACCGA GTTCCGACAC TTTCATTGAG AAAGATGCCT CGGTTAACGA ACATATTGAG
CAGATGCGTT TGTCCGCCAC CAAAGCGATG CTGGAGCGGC GTGATGTGGT TGTGGTGGCG
TCTGTTTCCG CGATTTATGG TCTGGGCGAT CCTGATTTAT ATCTCAAGAT GATGCTCCAT
CTCACGGTCG GTATGATTAT CGATCAGCGC GCGATTCTGC GCCGACTGGC GGAGCTGCAA
TACGCTCGTA ATGATCAAGC ATTCCAGCGT GGTACTTTCC GCGTTCGTGG CGAGGTGATA
GATATCTTCC CGGCAGAATC GGATGACATT GCACTTCGCG TGGAACTGTT TGACGAGGAA
GTGGAACGAT TGTCGTTATT TGACCCGCTG ACCGGGCAGA TTGTTTCCAC TATTCCACGT
TTTACCATCT ACCCGAAAAC GCACTACGTC ACACCGCGCG AGCGCATCGT ACAGGCGATG
GAGGAGATCA AAGAAGAGCT GGCCGCCAGA CGCAAAGTGC TGTTGGAAAA CAACAAACTG
CTGGAAGAGC AGCGGCTGAC CCAGCGTACC CAGTTTGATC TGGAGATGAT GAACGAGCTG
GGCTACTGTT CGGGGATTGA AAACTACTCG CGCTTCCTCT CCGGTCGTGG ACCGGGTGAG
CCACCGCCGA CGCTGTTTGA TTACCTGCCT GCCGATGGGC TGCTGGTCGT CGATGAATCT
CACGTCACCA TTCCACAAAT TGGCGGCATG TATCGCGGTG ACCGGGCGCG TAAAGAGACA
CTGGTGGAGT ACGGCTTCCG CCTGCCATCA GCGCTGGATA ACCGTCCGCT TAAGTTTGAA
GAGTTCGAAG CATTAGCGCC GCAAACCATC TATGTTTCGG CGACGCCGGG TAATTACGAG
CTGGAAAAAT CCGGCGGCGA TGTGGTGGAT CAGGTGGTGC GTCCAACCGG ATTGCTTGAC
CCGATTATCG AAGTGCGGCC GGTGGCGACA CAGGTTGATG ATCTTCTTTC GGAGATTCGT
CAGCGAGCGG CAATTAACGA ACGCGTACTG GTCACCACAC TGACCAAGCG GATGGCGGAA
GATCTTACCG AATATCTCGA AGAACATGGC GAGCGCGTGC GTTATCTTCA CTCAGATATC
GACACCGTCG AACGTATGGA GATTATCCGC GACTTGCGTC TGGGTGAGTT CGACGTGCTG
GTAGGGATCA ACTTACTGCG CGAAGGTCTG GATATGCCGG AAGTGTCGCT GGTGGCGATC
CTCGACGCTG ACAAAGAAGG CTTCCTGCGT TCCGAACGTT CGTTGATCCA GACCATTGGT
CGTGCGGCAC GTAACGTTAA CGGTAAAGCG ATTCTCTACG GCGATAAGAT CACCCCATCA
ATGGCGAAAG CGATTGGCGA AACCGAACGT CGCCGTGAGA AACAGCAGAA GTACAACGAG
GAACACGGAA TTACGCCGCA AGGCTTGAAC AAGAAAGTGG TCGATATCCT GGCGCTGGGG
CAGAACATTG CCAAAACCAA AGCGAAGGGC AGAGGAAAAT CGCGCCCGAT TGTTGAGCCG
GATAATGTGC CGATGGATAT GTCGCCTAAA GCGTTGCAGC AGAAAATCCA TGAGCTGGAA
GGGTTGATGA TGCAACACGC GCAGAATCTG GAGTTCGAAG AAGCGGCGCA AATTCGTGAC
CAGTTGCATC AGCTGCGTGA GCTGTTTATC GCGGCATCGT AA
 
Protein sequence
MSKPFKLNSA FKPSGDQPEA IRRLEEGLED GLAHQTLLGV TGSGKTFTIA NVIADLQRPT 
MVLAPNKTLA AQLYGEMKEF FPENAVEYFV SYYDYYQPEA YVPSSDTFIE KDASVNEHIE
QMRLSATKAM LERRDVVVVA SVSAIYGLGD PDLYLKMMLH LTVGMIIDQR AILRRLAELQ
YARNDQAFQR GTFRVRGEVI DIFPAESDDI ALRVELFDEE VERLSLFDPL TGQIVSTIPR
FTIYPKTHYV TPRERIVQAM EEIKEELAAR RKVLLENNKL LEEQRLTQRT QFDLEMMNEL
GYCSGIENYS RFLSGRGPGE PPPTLFDYLP ADGLLVVDES HVTIPQIGGM YRGDRARKET
LVEYGFRLPS ALDNRPLKFE EFEALAPQTI YVSATPGNYE LEKSGGDVVD QVVRPTGLLD
PIIEVRPVAT QVDDLLSEIR QRAAINERVL VTTLTKRMAE DLTEYLEEHG ERVRYLHSDI
DTVERMEIIR DLRLGEFDVL VGINLLREGL DMPEVSLVAI LDADKEGFLR SERSLIQTIG
RAARNVNGKA ILYGDKITPS MAKAIGETER RREKQQKYNE EHGITPQGLN KKVVDILALG
QNIAKTKAKG RGKSRPIVEP DNVPMDMSPK ALQQKIHELE GLMMQHAQNL EFEEAAQIRD
QLHQLRELFI AAS