Gene EcE24377A_3768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3768 
SymboldprA 
ID5589820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3762249 
End bp3763373 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content51% 
IMG OID640927392 
ProductDNA protecting protein DprA 
Protein accessionYP_001464753 
Protein GI157156735 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00137702 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGATA CAGAAATTTG GCTGCGTTTA ATGAGTATCA GCAGCTTGTA CGGCGATGAT 
ATGGTCCGTA TAGCTCACTG GCTGGCAAAA CAGTCGCAAA TTGATGCGGT TGGATTGCAG
CAAACAGGGC TTACATTGCG GCAGGCACAA CGCTTTCTTT CATTTCCGCG AAAGAGTATC
GAAAGCTCAC TTTGTTGGTT GGAGCAACCC AACCATCATT TAATCCCTGC GGACAGCGAA
TTTTATCCTC CTCAACTTCT GATGACGACA GATTACCCCG GTGCACTGTT TGTTGAAGGA
GAACTGCACG CGCTGCATTC ATTTCAGCTT GCCGTAGTGG GGAGTCGGGC GCATTCATGG
TATGGCGAGC GATGGGGACG GTTATTTTGC GAAACTCTGG CGACGCGTGG AGTGACAATT
ACGAGTGGAC TGGCGCGTGG AATCGATGGT GTGGCGCATA AAGCTGCCTT ACAGGTAAAT
GGCGTCAGCA TTGCTGTATT GGGGAATGGA CTTAATACCA TTCATCCCCG CCGCCATGCC
CGACTGGCTG CCAGTCTGCT TGAACATGGC GGAGCTCTCG TCTCGGAATT TCCCCTCGAT
GTTCCACCCC TTGCTTACAA TTTCCCACGA AGAAATCGCA TTATCAGTGG TCTAAGTAAA
GGTGTACTGG TGGTGGAAGC GGCTTTGCGC AGTGGTTCGC TGGTGACAGC ACGTTGTGCG
CTTGAGCAGG GGCGTGAAGT TTTTGCCTTG CCAGGTCCAA TAGGGAATCC GGGAAGCGAA
GGGCCTCACT GGTTAATAAA ACAAGGTGCG ATTCTTGTGA CGGAACCGGA AGAAATTCTG
GAAAACTTGC AATTTGGATT GCACTGGTTG CCAGACGCCC CTGAAAATTC ATTTTATTCA
CCAGATCAGC AAGACGTGGC ATTGCCATTT CCTGAGCTCC TGGCTAACGT AGGAGATGAG
GTAACACCTG TTGACGTCGT CGCTGAACGT GCCGGCCAAC CTGTGCCAGA GGTAGTTACT
CAACTACTCG AACTGGAGTT AGCAGGATGG ATCGCAGCTG TACCCGGCGG CTATGTCCGA
TTGAGGAGGG CATGCCATGT TCGACGTACT AATGTATTTG TTTGA
 
Protein sequence
MVDTEIWLRL MSISSLYGDD MVRIAHWLAK QSQIDAVGLQ QTGLTLRQAQ RFLSFPRKSI 
ESSLCWLEQP NHHLIPADSE FYPPQLLMTT DYPGALFVEG ELHALHSFQL AVVGSRAHSW
YGERWGRLFC ETLATRGVTI TSGLARGIDG VAHKAALQVN GVSIAVLGNG LNTIHPRRHA
RLAASLLEHG GALVSEFPLD VPPLAYNFPR RNRIISGLSK GVLVVEAALR SGSLVTARCA
LEQGREVFAL PGPIGNPGSE GPHWLIKQGA ILVTEPEEIL ENLQFGLHWL PDAPENSFYS
PDQQDVALPF PELLANVGDE VTPVDVVAER AGQPVPEVVT QLLELELAGW IAAVPGGYVR
LRRACHVRRT NVFV