Gene EcSMS35_2124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2124 
SymbolcbpA 
ID6142934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2132855 
End bp2133775 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content53% 
IMG OID641617000 
Productcurved DNA-binding protein CbpA 
Protein accessionYP_001744175 
Protein GI170682205 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.584386 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTAA AGGATTATTA CGCCATCATG GGCGTGAAAC CGACGGACGA TCTCAAGACA 
ATCAAGACCG CCTATCGTCG ACTGGCCCGC AAATACCATC CTGATGTCAG CAAAGAACCG
GATGCCGAAG CCCGCTTCAA AGAGGTCGCT GAAGCCTGGG AAGTGTTGAG TGATGAACAA
CGTCGCGCTG AGTATGATCA GATGTGGCAA CATCGCAACG ATCCGCAATT TAGCCGTCAG
TTCCAGCATG GCGACGGTCA GAGTTTTAAC GCCGAAGATT TTGACGATAT CTTCTCGTCA
ATTTTCGGTC AGCATGCCCG CCAGAGCCAT CAACGCCCCG CCACACGCGG CCACGATATT
GAAATCGAAG TGGCGGTATT CCTCGAAGAA ACGCTTACTG AGCATAAGCG TACCATCAGC
TATAACCTGC CGGTTTATAA CGCCTTTGGC ATGATCGAAC AGGAAATTCC GAAAACGCTG
AAAGTGAAGA TCCCGGCAGG GGTCGGCAAT GGTCAACGTA TCCGCCTGAA AGGCCAGGGG
ACGCCGGGCG AAAACGGCGG TCCAAATGGC GATTTGTGGC TGGTGATTCA TATTGCGCCA
CATCCGCTGT TTGATATTGT CGGCCAGGAT CTGGAAATTG TGGTGCCGGT TAGCCCGTGG
GAAGCGGCGC TGGGTGCTAA AGTCACCGTT CCAACACTGA AAGAAAGCAT TTTGCTGACT
ATCCCGCCTG GCAGCCAGGC CGGGCAACGA TTGCGCGTTA AAGGCAAAGG TCTGGTGAGC
AAAAAACAGA CCGGCGATCT GTATGCGGTA CTGAAAATCG TGATGCCGCC GAAACCGGAT
GAAAACACTG CCGCGCTGTG GCAGCAACTG GCAGACGCCC AGTCGTCTTT TGAACCACGT
AAAGATTGGG GGAAAGCATA A
 
Protein sequence
MELKDYYAIM GVKPTDDLKT IKTAYRRLAR KYHPDVSKEP DAEARFKEVA EAWEVLSDEQ 
RRAEYDQMWQ HRNDPQFSRQ FQHGDGQSFN AEDFDDIFSS IFGQHARQSH QRPATRGHDI
EIEVAVFLEE TLTEHKRTIS YNLPVYNAFG MIEQEIPKTL KVKIPAGVGN GQRIRLKGQG
TPGENGGPNG DLWLVIHIAP HPLFDIVGQD LEIVVPVSPW EAALGAKVTV PTLKESILLT
IPPGSQAGQR LRVKGKGLVS KKQTGDLYAV LKIVMPPKPD ENTAALWQQL ADAQSSFEPR
KDWGKA