Gene EcHS_A4292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4292 
SymboldnaB 
ID5595437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4296058 
End bp4297473 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content54% 
IMG OID640923394 
Productreplicative DNA helicase 
Protein accessionYP_001460839 
Protein GI157163521 
COG category[L] Replication, recombination and repair 
COG ID[COG0305] Replicative DNA helicase 
TIGRFAM ID[TIGR00665] replicative DNA helicase 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGGAA ATAAACCCTT CAACAAACAG CAGGCTGAAC CCCGCGAACG CGATCCACAA 
GTTGCCGGGC TGAAAGTGCC TCCGCACTCG ATCGAAGCGG AGCAGTCGGT GTTGGGCGGT
TTAATGCTGG ATAACGAACG CTGGGACGAT GTAGCCGAGC GTGTGGTGGC AGACGATTTT
TACACCCGCC CACACCGCCA TATCTTTACT GAAATGGCGC GTTTGCAGGA AAGCGGTAGT
CCTATCGATC TGATTACCCT TGCGGAATCG CTGGAACGCC AGGGGCAACT CGATAGCGTC
GGTGGTTTTG CTTATCTGGC AGAGCTGTCA AAAAATACGC CAAGTGCGGC GAACATCAGT
GCCTATGCGG ACATCGTGCG TGAACGTGCC GTTGTCCGTG AGATGATTTC GGTTGCGAAT
GAGATTGCTG AAGCCGGTTT TGATCCGCAG GGGCGCACCA GCGAAGATCT GCTGGACCTT
GCTGAATCCC GCGTCTTTAA AATTGCTGAA AGTCGTGCAA ACAAAGACGA AGGGCCGAAG
AACATCGCCG ATGTGCTCGA CGCAACCGTG GCGCGTATTG AGCAGTTGTT TCAGCAGCCA
CACGATGGCG TTACCGGAGT AAACACCGGT TATGACGATC TCAACAAAAA AACCGCTGGC
TTGCAGCCGT CGGATTTGAT CATCATCGCC GCGCGTCCGT CGATGGGTAA AACAACATTT
GCGATGAACC TCGTCGAAAA CGCGGCGATG TTGCAGGATA AACCAGTACT TATCTTCTCG
CTGGAGATGC CTTCAGAACA GATTATGATG CGTTCTCTGG CGTCGCTGTC GCGCGTTGAC
CAGACTAAAA TCCGTACCGG GCAGCTCGAT GATGAAGACT GGGCGCGCAT TTCCGGCACC
ATGGGTATTT TGCTCGAAAA ACGCAATATC TATATCGATG ACTCCTCCGG CCTGACGCCA
ACGGAAGTGC GTTCCCGCGC ACGCCGTATT GCCCGTGAAC ACGGCGGCAT CGGGCTTATC
ATGATCGACT ACCTGCAACT GATGCGCGTA CCGGCGCTTT CCGATAACCG TACGCTGGAA
ATTGCAGAAA TCTCTCGCTC GCTGAAAGCA CTGGCGAAAG AACTGAACGT GCCGGTGGTG
GCGCTGTCCC AGTTGAACCG TTCTCTGGAA CAACGTGCCG ACAAACGCCC GGTCAACTCC
GACCTGCGTG AATCTGGCTC TATCGAGCAG GATGCGGACT TGATCATGTT TATCTATCGT
GATGAGGTGT ATCACGAAAA CAGTGATTTA AAAGGCATCG CGGAAATTAT TATCGGTAAA
CAACGTAACG GCCCAATCGG GACGGTACGC CTGACCTTTA ACGGTCAATG GTCGCGCTTC
GACAACTATG CGGGGCCGCA GTACGACGAC GAATAA
 
Protein sequence
MAGNKPFNKQ QAEPRERDPQ VAGLKVPPHS IEAEQSVLGG LMLDNERWDD VAERVVADDF 
YTRPHRHIFT EMARLQESGS PIDLITLAES LERQGQLDSV GGFAYLAELS KNTPSAANIS
AYADIVRERA VVREMISVAN EIAEAGFDPQ GRTSEDLLDL AESRVFKIAE SRANKDEGPK
NIADVLDATV ARIEQLFQQP HDGVTGVNTG YDDLNKKTAG LQPSDLIIIA ARPSMGKTTF
AMNLVENAAM LQDKPVLIFS LEMPSEQIMM RSLASLSRVD QTKIRTGQLD DEDWARISGT
MGILLEKRNI YIDDSSGLTP TEVRSRARRI AREHGGIGLI MIDYLQLMRV PALSDNRTLE
IAEISRSLKA LAKELNVPVV ALSQLNRSLE QRADKRPVNS DLRESGSIEQ DADLIMFIYR
DEVYHENSDL KGIAEIIIGK QRNGPIGTVR LTFNGQWSRF DNYAGPQYDD E