Gene EcHS_A3479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3479 
SymboldprA 
ID5595057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3476779 
End bp3477903 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content51% 
IMG OID640922596 
ProductDNA protecting protein DprA 
Protein accessionYP_001460077 
Protein GI157162759 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.000661668 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGATA CAGAAATTTG GCTGCGTTTA ATGAGTATCA GCAGCTTGTA CGGCGATGAT 
ATGGTCCGTA TCGCTCACTG GGTGGCAAAA CAGTCGCATA TTGATGCGGT TGTATTGCAG
CAAACAGGGC TTACATTGCG GCAGGCACAA CGCTTTCTTT CATTTCCACG AAAGAGTATC
GAAAGCTCAC TTTGTTGGTT GGAGCAACCC AACCATCATT TAATTCCTGC GGACAGCGAA
TTTTATCCTC CTCAACTTCT GGCGACGACA GATTACCCCG GCGCACTGTT TGTTGAAGGA
GAACTGCACG CGCTGCATTC ATTTCAGCTT GCCGTAGTGG GGAGTCGGGC GCATTCATGG
TATGGCGAGC GATGGGGACG ATTATTTTGC GAAACTCTGG CGACGCGTGG AGTGACAATT
ACGAGTGGAC TGGCGCGTGG AATCGATGGT GTAGCGCATA AAGCAGCCTT ACAGGTAAAT
GGCGTCAGCA TTGCTGTATT GGGGAATGGA CTTAATACCA TTCATCCCCG CCGTCATGCC
CGACTGGCTG CCAGTCTGCT TGAACAGGGG GGCGCTCTCG TCTCGGAATT TCCCCTCGAT
GTTCCACCCC TTGCTTACAA TTTCCCACGA AGAAATCGCA TTATCAGTGG TCTAAGTAAA
GGTGTACTGG TGGTGGAAGC GGCTTTGCGT AGTGGTTCGC TGGTGACAGC ACGTTGTGCG
CTTGAGCAGG GGCGAGAAGT TTTTGCCTTG CCAGGTCCAA TAGGGAATCC GGGAAGCGAA
GGGCCTCACT GGTTAATAAA ACAAGGTGCG ATTCTTGTGA CGGAACCGGA AGAAATTCTG
GAAAACTTGC AATTTGGATT GCACTGGTTG CCAGACGCCC CTGAAAATTC ATTTTATTCA
CCAGATCAGC AAGACGTGGC ATTGCCATTT CCTGAGCTCC TGGCTAACGT AGGAGATGAG
GTAACACCTG TTGACGTCGT CGCTGAACGT GCCGGCCAAC CTGTGCCAGA GGTAGTTACT
CAACTACTCG AACTGGAGTT AGCAGGATGG ATCGCAGCTG TACCCGGCGG CTATGTCCGA
TTGAGGAGGG CATGCCATGT TCGACGTACT AATGTATTTG TTTGA
 
Protein sequence
MVDTEIWLRL MSISSLYGDD MVRIAHWVAK QSHIDAVVLQ QTGLTLRQAQ RFLSFPRKSI 
ESSLCWLEQP NHHLIPADSE FYPPQLLATT DYPGALFVEG ELHALHSFQL AVVGSRAHSW
YGERWGRLFC ETLATRGVTI TSGLARGIDG VAHKAALQVN GVSIAVLGNG LNTIHPRRHA
RLAASLLEQG GALVSEFPLD VPPLAYNFPR RNRIISGLSK GVLVVEAALR SGSLVTARCA
LEQGREVFAL PGPIGNPGSE GPHWLIKQGA ILVTEPEEIL ENLQFGLHWL PDAPENSFYS
PDQQDVALPF PELLANVGDE VTPVDVVAER AGQPVPEVVT QLLELELAGW IAAVPGGYVR
LRRACHVRRT NVFV