Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0823 |
Symbol | dinG |
ID | 6147507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 824981 |
End bp | 827131 |
Gene Length | 2151 bp |
Protein Length | 716 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615711 |
Product | ATP-dependent DNA helicase DinG |
Protein accession | YP_001742903 |
Protein GI | 170683984 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1199] Rad3-related DNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATTAA CCGCCGCGCT TAAAGCGCAA ATTGCCGCCT GGTATAAGGC GCTTCAGGAA CAGATCCCCG ACTTTATTCC CCGTGCGCCG CAGCGGCAGA TGATTGCGGA CGTCGCCAAA ACGCTGGCCG GAGAAGAAGG GCGGCATCTG GCGATTGAAG CCCCCACTGG CGTTGGGAAA ACGCTCTCCT ATTTGATCCC TGGCATCGCC ATTGCCCGCG AAGAGCAAAA AACGCTGGTG GTGAGTACCG CCAACGTGGC ATTGCAGGAT CAGATCTACA GTAAAGATTT ACCGCTGCTG AAAAAGATCA TTCCCGATCT TAAATTCACT GCCGCTTTTG GGCGTGGGCG CTACGTTTGT CCGCGTAATC TGACGGCGCT CGCCAGCACT GAACCCACGC AACAGGATTT GCTGGCGTTT CTTGACGACG AACTGACGCC GAACAATCAG GAAGAACAAA AACGTTGTGC GAAGCTGAAG GGCGATCTCG ACACTTATAA ATGGGATGGT CTGCGTGATC ATACTGATAT CGCTATAGAT GACGATCTCT GGCGTCGTTT GAGCACCGAC AAGGCCAGTT GCCTTAACCG CAACTGTTAT TACTATCGCG AATGCCCGTT TTTTGTCGCT CGTCGGGAGA TTCAGGAAGC GGAAGTGGTG GTGGCAAACC ATGCGCTGGT GATGGCGGCG ATGGAAAGCG AAGCCGTATT GCCTGACCCG AAAAATTTAC TGCTGGTGCT GGACGAAGGT CATCACCTGC CGGATGTGGC GCGGGATGCG CTGGAGATGA GTGCAGAAAT CACCGCACCG TGGTATCGGC TACAGCTGGA CTTGTTCACG AAACTGGTCG CTACCTGCAT GGAGCAGTTT CGCCCGAAGA CCATCCCGCC GCTGGCGATC CCTGAACGTT TGAATGCCCA TTGTGAAGAG TTGTATGAGC TTATCGCCTC GTTAAACAAC ATTCTCAACC TCTACATGCC TGCCGGGCAG GAGGCAGAGC ACCGTTTTGC GATGGGCGAA CTGCCTGATG AAGTACTGGA GATCTGCCAG CGGCTGGCAA AACTCACCGA GATGCTGCGT GGCCTGGCGG AGTTATTTCT TAACGATTTA AGTGAGAAAA CCGGCAGCCA TGACATTGTA CGCCTGCATC GGTTGATTTT GCAGATGAAC CGCGCGTTGG GGATGTTCGA GGCGCAAAGC AAACTCTGGC GGCTGGCGTC TCTGGCGCAA TCTTCCGGTG CGCCGGTGAC CAAATGGGCG ACGCGGGAAG AGCGCGAAGG GCAGCTGCAT CTCTGGTTTC ACTGTGTGGG TATTCGTGTT AGCGACCAAC TGGAAAGGCT GCTGTGGCGC AGTATTCCGC ACATTATTGT TACCTCCGCA ACCTTGCGTT CGCTGAACAG TTTTTCGCGT TTGCAGGAGA TGAGTGGGCT GAAAGAGAAA GCGGGCGACC GTTTTGTGGC GCTGGATTCC CCCTTTAACC ACTGCGAACA GGGCAAAATT GTTATTCCCC GGATGCGCGT TGAGCCTTCC CTCGACAACG AAGAGCAGCA TATTGCCGAA ATGGCGGCCT TTTTCCGTAA GCAGGTGGAG AGCAAAAAAC ATCTCGGTAT GTTGGTACTG TTTGCCAGCG GACGGGCGAT GCAGCGCTTT CTCGACTATG TGACGGATTT ACGTCTGATG TTGCTGGTGC AGGGCGATCA GCCGCGTTAC CGTTTAGTTG AACTGCACCG CAAACGCGTC GCCAACGGTG AGCGCAGCGT GCTGGTGGGC TTACAGTCAT TTGCCGAAGG GCTTGATCTG AAAGGCGATC TGCTCAGCCA GGTGCATATC CACAAAATCG CTTTTCCGCC CATCGACAGC CCCGTGGTGA TCACCGAAGG GGAATGGCTG AAAAGCCTCA ACCGCTATCC GTTTGAGGTG CAAAGCCTGC CGAGCGCCTC GTTTAACCTG ATTCAGCAGG TTGGGCGACT GATTCGAAGC CACGGTTGCT GGGGCGAAGT GGTGATTTAC GATAAACGCT TGCTCACCAA AAACTACGGC AAGCGGCTAC TGGATGCATT ACCGGTATTT CCGATAGAGC AACCGGAAGT CCCTGAAGGT ATAGTTAAAA AGAAAGAAAA AACGAAATCC CCACGCCGTC GGCGGCGTTA A
|
Protein sequence | MALTAALKAQ IAAWYKALQE QIPDFIPRAP QRQMIADVAK TLAGEEGRHL AIEAPTGVGK TLSYLIPGIA IAREEQKTLV VSTANVALQD QIYSKDLPLL KKIIPDLKFT AAFGRGRYVC PRNLTALAST EPTQQDLLAF LDDELTPNNQ EEQKRCAKLK GDLDTYKWDG LRDHTDIAID DDLWRRLSTD KASCLNRNCY YYRECPFFVA RREIQEAEVV VANHALVMAA MESEAVLPDP KNLLLVLDEG HHLPDVARDA LEMSAEITAP WYRLQLDLFT KLVATCMEQF RPKTIPPLAI PERLNAHCEE LYELIASLNN ILNLYMPAGQ EAEHRFAMGE LPDEVLEICQ RLAKLTEMLR GLAELFLNDL SEKTGSHDIV RLHRLILQMN RALGMFEAQS KLWRLASLAQ SSGAPVTKWA TREEREGQLH LWFHCVGIRV SDQLERLLWR SIPHIIVTSA TLRSLNSFSR LQEMSGLKEK AGDRFVALDS PFNHCEQGKI VIPRMRVEPS LDNEEQHIAE MAAFFRKQVE SKKHLGMLVL FASGRAMQRF LDYVTDLRLM LLVQGDQPRY RLVELHRKRV ANGERSVLVG LQSFAEGLDL KGDLLSQVHI HKIAFPPIDS PVVITEGEWL KSLNRYPFEV QSLPSASFNL IQQVGRLIRS HGCWGEVVIY DKRLLTKNYG KRLLDALPVF PIEQPEVPEG IVKKKEKTKS PRRRRR
|
| |