Gene EcolC_4186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4186 
Symbol 
ID6067433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4623848 
End bp4625677 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content55% 
IMG OID641603614 
ProductATP-dependent DNA helicase RecQ 
Protein accessionYP_001727110 
Protein GI170022156 
COG category[L] Replication, recombination and repair 
COG ID[COG0514] Superfamily II DNA helicase 
TIGRFAM ID[TIGR00614] ATP-dependent DNA helicase, RecQ family
[TIGR01389] ATP-dependent DNA helicase RecQ 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.802851 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCAGG CGGAAGTGTT GAATCTGGAG TCCGGAGCTA AACAGGTTTT ACAAGAAACC 
TTTGGCTACC AACAGTTTCG CCCCGGCCAG GAAGAAATTA TCGACACTGT GCTTTCCGGT
CGCGATTGCC TGGTCGTCAT GCCCACCGGT GGCGGAAAAT CCCTTTGCTA TCAAATTCCT
GCCTTATTGC TAAACGGCCT TACCGTGGTT GTTTCACCGC TGATTTCGTT GATGAAAGAT
CAGGTGGATC AACTGCAAGC CAACGGCGTG GCGGCGGCGT GCCTTAACTC GACGCAAACC
CGCGAACAGC AACTTGAAGT GATGACAGGC TGCCGCACCG GGCAAATTCG CTTACTGTAT
ATCGCCCCGG AACGCCTGAT GCTGGATAAC TTTCTTGAGC ATCTGGCGCA CTGGAATCCG
GTGTTATTAG CCGTCGATGA AGCGCACTGT ATCTCCCAAT GGGGCCACGA TTTCCGCCCG
GAATATGCCG CGCTCGGTCA GTTGCGCCAG CGGTTCCCGA CGCTGCCGTT TATGGCGCTG
ACCGCCACAG CCGACGACAC CACGCGCCAG GATATCGTGC GCCTGCTGGG GCTGATCGAT
CCGCTGATTC AAATCAGCAG TTTTGACCGT CCGAATATTC GCTACATGCT GATGGAGAAG
TTCAAACCGC TCGATCAGTT GATGCGCTAC GTGCAGGAAC AGCGCGGTAA GTCAGGCATT
ATCTACTGCA ACAGCCGCGC GAAAGTAGAA GACACCGCTG CGCGCCTGCA AAGCAAGGGA
ATTAGCGCGG CGGCCTATCA TGCCGGGCTG GAAAATAATG TTCGCGCCGA TGTGCAGGAA
AAATTCCAGC GCGATGACCT GCAAATTGTG GTGGCGACGG TGGCGTTCGG CATGGGCATC
AATAAACCAA ACGTTCGCTT CGTGGTCCAC TTTGATATTC CGCGCAATAT CGAATCCTAT
TATCAGGAAA CCGGACGCGC CGGGCGTGAT GGCCTGCCCG CGGAAGCGAT GCTGTTTTAC
GATCCGGCTG ATATGGCGTG GCTGCGCCGT TGTCTGGAAG AGAAGCCGCA GGGGCAGTTG
CAGGATATCG AGCGCCACAA ACTCAATGCG ATGGGCGCGT TTGCCGAAGC GCAAACTTGC
CGTCGTCTGG TATTGCTGAA CTATTTTGGC GAAGGGCGTC AGGAGCCGTG CGGGAACTGC
GATATCTGCC TCGATCCGCC GAAACAGTAC GACGGTTCAA CCGATGCTCA GATTGCCCTT
TCCACCATTG GTCGTGTGAA TCAGCGGTTT GGGATGGGTT ATGTGGTGGA AGTGATTCGT
GGTGCTAATA ACCAGCGTAT CCGCGACTAT GGTCATGACA AACTGAAAGT CTATGGCATG
GGCCGTGATA AAAGCCATGA ACATTGGGTG AGCGTGATCC GCCAGCTGAT TCACCTCGGC
CTGGTGACGC AAAATATTGC CCAGCATTCT GCCCTACAAC TGACAGAGGC CGCGCGCCCG
GTGCTGCGCG GCGAATCCTC TTTGCAACTT GCCGTGCCGC GTATCGTGGC GCTCAAACCG
AAAGCGATGC AGAAATCGTT CGGCGGCAAC TATGATCGCA AACTGTTCGC CAAATTACGC
AAACTGCGTA AATCGATTGC CGATGAAAGC AATGTCCCGC CGTACGTGGT GTTTAACGAC
GCAACCTTGA TTGAGATGGC TGAACAGATG CCGATCACCG CCAGCGAAAT GCTCAGCGTT
AACGGCGTTG GGATGCGCAA GCTGGAACGC TTTGGTAAAC CGTTTATGGC GCTTATCCGC
GCGCATGTTG ACGGCGACGA CGAAGAGTAG
 
Protein sequence
MAQAEVLNLE SGAKQVLQET FGYQQFRPGQ EEIIDTVLSG RDCLVVMPTG GGKSLCYQIP 
ALLLNGLTVV VSPLISLMKD QVDQLQANGV AAACLNSTQT REQQLEVMTG CRTGQIRLLY
IAPERLMLDN FLEHLAHWNP VLLAVDEAHC ISQWGHDFRP EYAALGQLRQ RFPTLPFMAL
TATADDTTRQ DIVRLLGLID PLIQISSFDR PNIRYMLMEK FKPLDQLMRY VQEQRGKSGI
IYCNSRAKVE DTAARLQSKG ISAAAYHAGL ENNVRADVQE KFQRDDLQIV VATVAFGMGI
NKPNVRFVVH FDIPRNIESY YQETGRAGRD GLPAEAMLFY DPADMAWLRR CLEEKPQGQL
QDIERHKLNA MGAFAEAQTC RRLVLLNYFG EGRQEPCGNC DICLDPPKQY DGSTDAQIAL
STIGRVNQRF GMGYVVEVIR GANNQRIRDY GHDKLKVYGM GRDKSHEHWV SVIRQLIHLG
LVTQNIAQHS ALQLTEAARP VLRGESSLQL AVPRIVALKP KAMQKSFGGN YDRKLFAKLR
KLRKSIADES NVPPYVVFND ATLIEMAEQM PITASEMLSV NGVGMRKLER FGKPFMALIR
AHVDGDDEE