Gene EcSMS35_4513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4513 
SymboldnaB 
ID6145859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4611881 
End bp4613296 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content54% 
IMG OID641619329 
Productreplicative DNA helicase 
Protein accessionYP_001746441 
Protein GI170683120 
COG category[L] Replication, recombination and repair 
COG ID[COG0305] Replicative DNA helicase 
TIGRFAM ID[TIGR00665] replicative DNA helicase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGAA ATAAACCCTT CAACAAACAG CAGGCTGAAC CCCGCGAACG CGATCCACAA 
GTTGCCGGGC TGAAAGTGCC TCCGCACTCG ATCGAAGCGG AGCAGTCGGT GTTGGGCGGT
TTAATGCTGG ATAACGAACG CTGGGATGAT GTAGCCGAGC GTGTGGTGGC AGACGACTTT
TACACCCGCC CACACCGTCA TATCTTTACT GAAATGGCGC GTTTGCAGGA AAGCGGTAGT
CCTATCGATC TGATTACCCT TGCGGAATCG CTGGAACGCC AGGGGCAACT TGATAGCGTC
GGTGGTTTCG CTTATCTGGC GGAGCTGTCA AAAAATACGC CAAGTGCGGC GAACATCAGT
GCTTATGCTG ACATCGTGCG TGAACGTGCC GTTGTTCGCG AGATGATTTC GGTTGCGAAT
GAGATTGCCG AAGCCGGTTT TGATCCGCAG GGGCGTACCA GCGAAGATCT GCTGGACCTG
GCTGAATCCC GCGTCTTTAA AATTGCCGAA AGTCGTGCAA ACAAAGACGA AGGGCCGAAG
AACATCGCCG ATGTGCTCGA CGCCACGGTG GCGCGTATTG AGCAGTTGTT TCAGCAGCCA
CACGATGGCG TTACCGGGGT AAACACCGGT TATGACGATC TCAACAAAAA AACCGCTGGC
TTGCAGCCGT CGGATTTGAT CATCGTCGCC GCGCGTCCGT CGATGGGTAA AACAACATTT
GCGATGAACC TCGTCGAAAA CGCGGCGATG TTGCAGGATA AACCAGTACT TATCTTCTCG
CTGGAGATGC CATCAGAACA GATTATGATG CGTTCTTTGG CGTCGCTGTC GCGCGTTGAC
CAGACTAAAA TCCGTACCGG GCAGCTCGAT GATGAAGACT GGGCACGTAT TTCCGGCACC
ATGGGTATTT TGCTCGAAAA ACGCAATATC TATATCGATG ACTCCTCCGG CCTGACGCCA
ACGGAAGTGC GTTCCCGCGC ACGCCGTATT GCCCGTGAAC ACGGCGGCAT CGGGCTTATC
ATGATCGACT ACCTGCAACT GATGCGCGTA CCGGCGCTTT CCGATAACCG TACGCTGGAA
ATTGCAGAAA TCTCCCGCTC GCTGAAAGCA CTGGCGAAAG AACTGAACGT GCCGGTGGTG
GCGCTGTCCC AGTTGAACCG TTCTCTGGAA CAACGTGCCG ACAAACGCCC GGTCAACTCC
GACCTGCGTG AATCTGGCTC TATCGAGCAG GATGCGGACT TGATCATGTT TATCTATCGT
GATGAGGTGT ATCACGAAAA CAGTGATTTA AAAGGCATCG CGGAAATTAT TATCGGTAAA
CAACGTAACG GCCCAATCGG GACGGTACGC CTGACCTTTA ACGGTCAATG GTCGCGCTTC
GACAACTATG CGGGGCCGCA GTACGACGAC GAATAA
 
Protein sequence
MAGNKPFNKQ QAEPRERDPQ VAGLKVPPHS IEAEQSVLGG LMLDNERWDD VAERVVADDF 
YTRPHRHIFT EMARLQESGS PIDLITLAES LERQGQLDSV GGFAYLAELS KNTPSAANIS
AYADIVRERA VVREMISVAN EIAEAGFDPQ GRTSEDLLDL AESRVFKIAE SRANKDEGPK
NIADVLDATV ARIEQLFQQP HDGVTGVNTG YDDLNKKTAG LQPSDLIIVA ARPSMGKTTF
AMNLVENAAM LQDKPVLIFS LEMPSEQIMM RSLASLSRVD QTKIRTGQLD DEDWARISGT
MGILLEKRNI YIDDSSGLTP TEVRSRARRI AREHGGIGLI MIDYLQLMRV PALSDNRTLE
IAEISRSLKA LAKELNVPVV ALSQLNRSLE QRADKRPVNS DLRESGSIEQ DADLIMFIYR
DEVYHENSDL KGIAEIIIGK QRNGPIGTVR LTFNGQWSRF DNYAGPQYDD E