Gene EcolC_0428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0428 
Symbol 
ID6067714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp461370 
End bp462494 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content51% 
IMG OID641599827 
ProductDNA protecting protein DprA 
Protein accessionYP_001723433 
Protein GI170018479 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0030328 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00135165 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTCGATA CAGAAATTTG GCTGCGTTTA ATGAGTATCA GCAGCTTGTA CGGCGATGAT 
ATGGTCCGTA TCGCTCACTG GGTGGCAAAA CAGTCGCATA TTGATGCGGT TGTATTGCAG
CAAACAGGGC TTACATTGCG GCAGGCACAA CGCTTTCTTT CATTTCCACG AAAGAGTATC
GAAAGCTCAC TTTGTTGGTT GGAGCAACCC AACCATCATT TAATTCCTGC GGACAGCGAA
TTTTATCCTC CTCAACTTCT GGCGACGACA GATTACCCCG GCGCACTGTT TGTTGAAGGA
GAACTGCACG CGCTGCATTC ATTTCAGCTT GCCGTAGTGG GGAGTCGGGC GCATTCATGG
TATGGCGAGC GATGGGGACG ATTATTTTGC GAAACTCTGG CGACGCGTGG AGTGACAATT
ACGAGTGGAC TGGCGCGTGG AATCGATGGT GTAGCGCATA AAGCAGCCTT ACAGGTAAAT
GGCGTCAGCA TTGCTGTATT GGGGAATGGA CTTAATACCA TTCATCCCCG CCGTCATGCC
CGACTGGCTG CCAGTCTGCT TGAACAGGGG GGCGCTCTCG TCTCGGAATT TCCCCTCGAT
GTTCCACCCC TTGCTTACAA TTTCCCACGA AGAAATCGCA TTATCAGTGG TCTAAGTAAA
GGTGTACTGG TGGTGGAAGC GGCTTTGCGT AGTGGTTCGC TGGTGACAGC ACGTTGTGCG
CTTGAGCAGG GGCGAGAAGT TTTTGCCTTG CCAGGTCCAA TAGGGAATCC GGGAAGCGAA
GGGCCTCACT GGTTAATAAA ACAAGGTGCG ATTCTTGTGA CGGAACCGGA AGAAATTCTG
GAAAACTTGC AATTTGGATT GCACTGGTTG CCAGACGCCC CTGAAAATTC ATTTTATTCA
CCAGATCAGC AAGACGTGGC ATTGCCATTT CCTGAGCTCC TGGCTAACGT AGGAGATGAG
GTAACACCTG TTGACGTCGT CGCTGAACGT GCCGGCCAAC CTGTGCCAGA GGTAGTTACT
CAACTACTCG AACTGGAGTT AGCAGGATGG ATCGCAGCTG TACCCGGCGG CTATGTCCGA
TTGAGGAGGG CATGCCATGT TCGACGTACT AATGTATTTG TTTGA
 
Protein sequence
MVDTEIWLRL MSISSLYGDD MVRIAHWVAK QSHIDAVVLQ QTGLTLRQAQ RFLSFPRKSI 
ESSLCWLEQP NHHLIPADSE FYPPQLLATT DYPGALFVEG ELHALHSFQL AVVGSRAHSW
YGERWGRLFC ETLATRGVTI TSGLARGIDG VAHKAALQVN GVSIAVLGNG LNTIHPRRHA
RLAASLLEQG GALVSEFPLD VPPLAYNFPR RNRIISGLSK GVLVVEAALR SGSLVTARCA
LEQGREVFAL PGPIGNPGSE GPHWLIKQGA ILVTEPEEIL ENLQFGLHWL PDAPENSFYS
PDQQDVALPF PELLANVGDE VTPVDVVAER AGQPVPEVVT QLLELELAGW IAAVPGGYVR
LRRACHVRRT NVFV