Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0428 |
Symbol | |
ID | 6067714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 461370 |
End bp | 462494 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641599827 |
Product | DNA protecting protein DprA |
Protein accession | YP_001723433 |
Protein GI | 170018479 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0030328 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00135165 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTCGATA CAGAAATTTG GCTGCGTTTA ATGAGTATCA GCAGCTTGTA CGGCGATGAT ATGGTCCGTA TCGCTCACTG GGTGGCAAAA CAGTCGCATA TTGATGCGGT TGTATTGCAG CAAACAGGGC TTACATTGCG GCAGGCACAA CGCTTTCTTT CATTTCCACG AAAGAGTATC GAAAGCTCAC TTTGTTGGTT GGAGCAACCC AACCATCATT TAATTCCTGC GGACAGCGAA TTTTATCCTC CTCAACTTCT GGCGACGACA GATTACCCCG GCGCACTGTT TGTTGAAGGA GAACTGCACG CGCTGCATTC ATTTCAGCTT GCCGTAGTGG GGAGTCGGGC GCATTCATGG TATGGCGAGC GATGGGGACG ATTATTTTGC GAAACTCTGG CGACGCGTGG AGTGACAATT ACGAGTGGAC TGGCGCGTGG AATCGATGGT GTAGCGCATA AAGCAGCCTT ACAGGTAAAT GGCGTCAGCA TTGCTGTATT GGGGAATGGA CTTAATACCA TTCATCCCCG CCGTCATGCC CGACTGGCTG CCAGTCTGCT TGAACAGGGG GGCGCTCTCG TCTCGGAATT TCCCCTCGAT GTTCCACCCC TTGCTTACAA TTTCCCACGA AGAAATCGCA TTATCAGTGG TCTAAGTAAA GGTGTACTGG TGGTGGAAGC GGCTTTGCGT AGTGGTTCGC TGGTGACAGC ACGTTGTGCG CTTGAGCAGG GGCGAGAAGT TTTTGCCTTG CCAGGTCCAA TAGGGAATCC GGGAAGCGAA GGGCCTCACT GGTTAATAAA ACAAGGTGCG ATTCTTGTGA CGGAACCGGA AGAAATTCTG GAAAACTTGC AATTTGGATT GCACTGGTTG CCAGACGCCC CTGAAAATTC ATTTTATTCA CCAGATCAGC AAGACGTGGC ATTGCCATTT CCTGAGCTCC TGGCTAACGT AGGAGATGAG GTAACACCTG TTGACGTCGT CGCTGAACGT GCCGGCCAAC CTGTGCCAGA GGTAGTTACT CAACTACTCG AACTGGAGTT AGCAGGATGG ATCGCAGCTG TACCCGGCGG CTATGTCCGA TTGAGGAGGG CATGCCATGT TCGACGTACT AATGTATTTG TTTGA
|
Protein sequence | MVDTEIWLRL MSISSLYGDD MVRIAHWVAK QSHIDAVVLQ QTGLTLRQAQ RFLSFPRKSI ESSLCWLEQP NHHLIPADSE FYPPQLLATT DYPGALFVEG ELHALHSFQL AVVGSRAHSW YGERWGRLFC ETLATRGVTI TSGLARGIDG VAHKAALQVN GVSIAVLGNG LNTIHPRRHA RLAASLLEQG GALVSEFPLD VPPLAYNFPR RNRIISGLSK GVLVVEAALR SGSLVTARCA LEQGREVFAL PGPIGNPGSE GPHWLIKQGA ILVTEPEEIL ENLQFGLHWL PDAPENSFYS PDQQDVALPF PELLANVGDE VTPVDVVAER AGQPVPEVVT QLLELELAGW IAAVPGGYVR LRRACHVRRT NVFV
|
| |