Gene EcolC_3976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3976 
Symbol 
ID6064512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4368143 
End bp4369558 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content53% 
IMG OID641603389 
Productreplicative DNA helicase 
Protein accessionYP_001726904 
Protein GI170021950 
COG category[L] Replication, recombination and repair 
COG ID[COG0305] Replicative DNA helicase 
TIGRFAM ID[TIGR00665] replicative DNA helicase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.40263 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGAA ATAAACCCTT CAACAAACAG CAGGCTGAAC CCCGCGAACG CGATCCACAA 
GTTGCCGGGC TGAAAGTGCC TCCGCACTCG ATCGAAGCGG AGCAGTCGGT GTTGGGCGGT
TTAATGCTAG ATAACGAACG CTGGGATGAT GTAGCCGAGC GTGTGGTAGC AGACGATTTT
TACACCCGCC CACACCGTCA TATCTTTACT GAAATGGCGC GTTTGCAGGA AAGCGGTAGC
CCTATCGATC TAATTACTCT TGCGGAATCG CTGGAACGCC AGGGGCAACT CGATAGCGTC
GGTGGTTTTG CTTATCTGGC AGAGCTGTCA AAAAATACGC CAAGTGCGGC GAACATCAGT
GCTTATGCTG ACATCGTGCG TGAACGTGCC GTTGTCCGTG AGATGATCTC GGTTGCGAAT
GAGATTGCTG AAGCCGGTTT TGATCCGCAG GGGCGTACCA GCGAAGATCT GCTGGACCTT
GCTGAATCCC GCGTCTTTAA AATTGCCGAA AGTCGTGCAA ACAAAGACGA AGGGCCGAAG
AACATCGCCG ATGTGCTCGA CGCAACCGTG GCGCGTATTG AGCAGTTGTT TCAGCAGCCA
CACGATGGCG TTACCGGAGT AAACACCGGT TATGACGATC TCAACAAAAA AACCGCTGGC
TTGCAGCCGT CGGATTTGAT CATCGTCGCC GCGCGTCCGT CGATGGGTAA AACAACATTT
GCGATGAACC TCGTCGAAAA CGCGGCGATG TTGCAGGATA AACCAGTACT TATCTTCTCG
CTGGAGATGC CTTCAGAACA GATTATGATG CGTTCTCTGG CGTCGCTGTC GCGCGTTGAC
CAGACTAAAA TCCGTACCGG GCAGCTCGAT GATGAAGACT GGGCGCGCAT TTCCGGCACC
ATGGGTATTT TGCTCGAAAA ACGCAATATC TATATCGATG ACTCCTCCGG CCTGACGCCA
ACGGAAGTGC GTTCCCGCGC ACGCCGTATT GCCCGTGAAC ACGGCGGCAT CGGGCTTATC
ATGATCGACT ACCTGCAACT GATGCGCGTA CCGGCGCTTT CCGATAACCG TACGCTGGAA
ATTGCAGAAA TCTCTCGCTC GCTGAAAGCA CTGGCGAAAG AACTGAACGT GCCGGTGGTG
GCGCTGTCCC AGTTGAACCG TTCTCTGGAA CAACGTGCCG ACAAACGCCC GGTCAACTCC
GACCTGCGTG AATCTGGCTC TATCGAGCAG GATGCGGACT TGATCATGTT TATCTATCGT
GATGAGGTGT ATCACGAAAA CAGTGATTTA AAAGGCATCG CGGAAATTAT TATCGGTAAA
CAACGTAACG GCCCAATCGG GACGGTACGC CTGACCTTTA ACGGTCAATG GTCGCGCTTC
GACAACTATG CGGGGCCGCA GTACGACGAC GAATAA
 
Protein sequence
MAGNKPFNKQ QAEPRERDPQ VAGLKVPPHS IEAEQSVLGG LMLDNERWDD VAERVVADDF 
YTRPHRHIFT EMARLQESGS PIDLITLAES LERQGQLDSV GGFAYLAELS KNTPSAANIS
AYADIVRERA VVREMISVAN EIAEAGFDPQ GRTSEDLLDL AESRVFKIAE SRANKDEGPK
NIADVLDATV ARIEQLFQQP HDGVTGVNTG YDDLNKKTAG LQPSDLIIVA ARPSMGKTTF
AMNLVENAAM LQDKPVLIFS LEMPSEQIMM RSLASLSRVD QTKIRTGQLD DEDWARISGT
MGILLEKRNI YIDDSSGLTP TEVRSRARRI AREHGGIGLI MIDYLQLMRV PALSDNRTLE
IAEISRSLKA LAKELNVPVV ALSQLNRSLE QRADKRPVNS DLRESGSIEQ DADLIMFIYR
DEVYHENSDL KGIAEIIIGK QRNGPIGTVR LTFNGQWSRF DNYAGPQYDD E