Gene EcSMS35_4179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4179 
SymboluvrD 
ID6144651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4277633 
End bp4279795 
Gene Length2163 bp 
Protein Length720 aa 
Translation table11 
GC content58% 
IMG OID641619002 
ProductDNA-dependent helicase II 
Protein accessionYP_001746130 
Protein GI170679582 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID[TIGR01075] DNA helicase II 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.654606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0232596 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTTT CTTACCTGCT CGACAGCCTT AATGACAAAC AGCGCGAAGC GGTGGCCGCG 
CCACGCAGCA ACCTTCTGGT GCTGGCGGGC GCGGGCAGTG GTAAGACGCG CGTACTGGTG
CATCGTATCG CCTGGCTGAT GAGCGTGGAA AACTGCTCGC CATACTCGAT TATGGCGGTA
ACGTTTACCA ACAAAGCGGC GGCGGAGATG CGTCATCGTA TCGGGCAGCT GATGGGCACC
AGCCAGGGCG GCATGTGGGT CGGCACCTTC CACGGGCTGG CGCACCGTCT GCTGCGTGCG
CACCATATGG ACGCCAATCT GCCGCAGGAT TTTCAGATCC TCGACAGCGA GGACCAGCTA
CGCCTGCTCA AACGCCTAAT CAAGGCGATG AACCTCGACG AGAAGCAGTG GCCGCCGCGC
CAGGCAATGT GGTACATCAA CAGCCAGAAA GATGAAGGCC TGCGTCCACA TCATATTCAA
AGCTACGGTA ATCCGGTGGA GCAGACCTGG CAGAAGGTGT ATCAGGCGTA TCAGGAAGCG
TGTGACCGCG CGGGGCTGGT GGACTTCGCC GAGCTGCTGC TACGTGCTCA CGAGTTGTGG
CTTAACAAGC CGCATATCCT TCAGCACTAT CGCGAACGGT TTACCAATAT CCTGGTGGAC
GAATTCCAGG ATACCAACAA CATTCAGTAT GCGTGGATCC GCCTGCTGGC GGGCGATACC
GGCAAAGTGA TGATCGTTGG CGATGACGAT CAGTCAATCT ACGGCTGGCG CGGAGCGCAG
GTGGAGAATA TTCAGCGTTT CCTCAATGAT TTCCCCGGTG CTGAAACTAT CCGTCTGGAG
CAGAACTACC GCTCTACCAG CAATATTCTG AGCGCCGCGA ACGCCCTGAT TGAAAACAAC
AACGGGCGTT TGGGTAAAAA ACTGTGGACC GATGGCGCGG ATGGTGAGCC TATTTCCCTC
TATTGCGCTT TTAACGAACT CGATGAAGCG CGTTTTGTGG TTAACCGTAT CAAAACCTGG
CAGGACAACG GCGGGGCGCT TGCCGAGTGC GCCATTCTCT ACCGCAGCAA CGCCCAGTCG
CGTGTGCTGG AAGAAGCACT GTTACAGGCA AGTATGCCGT ACCGTATTTA CGGCGGGATG
CGATTCTTCG AACGCCAGGA AATCAAAGAT GCGCTCTCGT ATCTGCGCCT GATTGCCAAC
CGTAATGACG ACGCGGCCTT TGAGCGCGTG GTGAATACAC CAACGCGGGG TATTGGTGAC
CGGACGCTGG ACGTGGTACG TCAGACATCG CGTGATCGCC AGTTAACACT CTGGCAGGCA
TGTCGTGAAC TGTTGCAGGA AAAAGCCCTC GCCGGACGTG CTGCCAGTGC CTTACAGCGG
TTTATGGAGC TGATCGACGC CTTAGCGCAG GAAACTGCCG ATATGCCGCT GCATGTACAG
ACTGACCGGG TAATTAAAGA CTCCGGCTTG CGCACCATGT ACGAGCAGGA GAAGGGCGAA
AAAGGTCAGA CGCGTATCGA AAACCTGGAG GAACTGGTGA CGGCAACGCG CCAGTTCAGC
TACAACGAAG AAGACGAAGA TTTAATGCCG CTGCAGGCAT TCCTCTCCCA TGCGGCGCTG
GAAGCGGGCG AGGGGCAGGC GGATACCTGG CAGGACGCGG TGCAGTTGAT GACGCTACAC
TCGGCGAAAG GCCTGGAGTT CCCGCAGGTG TTTATCGTTG GTATGGAAGA GGGCATGTTC
CCAAGCCAGA TGTCGCTGGA TGAAGGCGGA CGTCTGGAAG AAGAACGCCG TCTGGCCTAC
GTTGGCGTAA CCCGCGCGAT GCAGAAACTG ACGCTGACCT ACGCGGAAAC TCGCCGTCTG
TATGGCAAAG AGGTTTACCA TCGCCCGTCG CGCTTTATCG GTGAGTTGCC GGAAGAGTGT
GTGGAAGAGG TGCGCCTGCG CGCTACGGTA AGCCGCCCGG TCAGCCATCA ACGGATGGGT
ACGCCGATGG TCGAGAACGA CAGCGGCTAC AAGCTCGGCC AGCGCGTACG CCACGCTAAG
TTTGGTGAAG GCACCATCGT CAATATGGAA GGCAGCGGCG AGCATAGCCG TTTGCAGGTG
GCATTCCAGG GCCAGGGTAT TAAATGGCTG GTGGCGGCTT ATGCCCGACT GGAGACGGTG
TAA
 
Protein sequence
MDVSYLLDSL NDKQREAVAA PRSNLLVLAG AGSGKTRVLV HRIAWLMSVE NCSPYSIMAV 
TFTNKAAAEM RHRIGQLMGT SQGGMWVGTF HGLAHRLLRA HHMDANLPQD FQILDSEDQL
RLLKRLIKAM NLDEKQWPPR QAMWYINSQK DEGLRPHHIQ SYGNPVEQTW QKVYQAYQEA
CDRAGLVDFA ELLLRAHELW LNKPHILQHY RERFTNILVD EFQDTNNIQY AWIRLLAGDT
GKVMIVGDDD QSIYGWRGAQ VENIQRFLND FPGAETIRLE QNYRSTSNIL SAANALIENN
NGRLGKKLWT DGADGEPISL YCAFNELDEA RFVVNRIKTW QDNGGALAEC AILYRSNAQS
RVLEEALLQA SMPYRIYGGM RFFERQEIKD ALSYLRLIAN RNDDAAFERV VNTPTRGIGD
RTLDVVRQTS RDRQLTLWQA CRELLQEKAL AGRAASALQR FMELIDALAQ ETADMPLHVQ
TDRVIKDSGL RTMYEQEKGE KGQTRIENLE ELVTATRQFS YNEEDEDLMP LQAFLSHAAL
EAGEGQADTW QDAVQLMTLH SAKGLEFPQV FIVGMEEGMF PSQMSLDEGG RLEEERRLAY
VGVTRAMQKL TLTYAETRRL YGKEVYHRPS RFIGELPEEC VEEVRLRATV SRPVSHQRMG
TPMVENDSGY KLGQRVRHAK FGEGTIVNME GSGEHSRLQV AFQGQGIKWL VAAYARLETV