Gene EcSMS35_0671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0671 
SymbolhscC 
ID6146172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp682379 
End bp684049 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content48% 
IMG OID641615561 
ProductDnaK family protein HscC 
Protein accessionYP_001742767 
Protein GI170683933 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAATG CAGAACTCGC CATTGGTATC GATCTCGGTA CTACCAATAG TTTAATTGCC 
GTCTGGAAAG ACGGTGCCGC GCAATTAATT CCAAATAAGT TCGGTGAATA TTTAACACCA
TCCATAATTA GCATGGATGA AAATAATCAT ATTTTAGTCG GAAAACCGGC TGTATCACGG
CGTACTTCGC ATCCGGATAA AACGGCAGCG TTATTTAAAC GTGCAATGGG CAGTAATACC
AACTGGCGGT TAGGCAGCGA CACATTTAAC GCGCCAGAAC TGTCCTCTTT GGTATTACGC
TCATTAAAAG AAGATGCCGA AGAATTTCTG CAACGTCCGA TTAAAGATGT GGTGATCTCC
GTTCCGGCTT ATTTCAGCGA TGAACAACGC AAGCATACCC GTTTAGCAGC GGAGTTAGCC
GGGTTAAATG CGGTACGCTT AATTAATGAA CCCACAGCAG CTGCGATGGC ATATGGCCTG
CATACGCAAC AAAATACCCG CTCGCTGGTT TTTGATCTCG GTGGCGGCAC GTTTGACGTC
ACGGTGCTCG AATACGCCAC GCCGGTGATT GAAGTTCACG CCTCCGCTGG CGACAACTTT
CTTGGCGGCG AAGATTTTAC TCATATGCTG GTGGATGAAG TTTTAAAACG CGCCGCAGTG
GCTAAAAACA TGCTCAACGA AAGTGAGCTG GCAGCCCTGT ACACCAGCGT GGAAGCGGCA
AAATGTAGCA ATCAATTACC GCTGCAAATT AGCTGGCAGT ATCAGGAAGA AACGCGTGAG
TGCGAATTTT ACGAGAACGA ACTGGAAGAT TTGTGGTTGC CGCTGCTCAA TCGCTTGCGA
GTGCCGATTG AACAGGCGTT GCGCGATGCA CGTCTGAAGC CAAGTCAAAT CGACAGTCTG
GTGCTGGTTG GCGGCGCATC ACAAATGCCG CTGGTGCAGC GAATCGCCGT GCGTCTGTTT
GGCAAATTAC CGTATCAAAG TTACGATCCG AGCACCATTG TCGCGCTGGG CGCAGCAATC
CAGGCCGCCT GCCGCTTACG CAGTGAAGAT ATTGAAGAAG TGATCCTCAC TGATATTTGC
CCCTACTCCT TAGGTGTTGA AGTTAACCGC CAGGGCGTTT CCGGCATTTT CTCACCGATT
ATTGAACGAA ATACCACTGT GCCGGTGTCG CGAGTAGAAA CTTACTCGAC CATGCACCCG
GAGCAGGATT CAATAACGAT TAATGTTTAT CAGGGAGAAA ATCACAAAGT TAAGAATAAT
ATACTGGTTG AATCCTTTGA TGTTCCGCTG AAGAAAACCG GGGCTTATCA GTCTATTGAT
ATTCGCTTTA GTTATGATAT TAACGGATTG CTTGAAGTTG ACGTGCTTCT GGAAGACGGC
AGCGTTAAGT CCAGAGTCAT TAACCACAGC CCGGTAACGT TAAGTACCCA GCAAATTGAA
GAGAGTCGGG CACGGTTATC CGCATTAAAA ATTTATCCGC GCGATATGCT CATCAACCGT
ACCTTTAAAG CCAAACTGGA AGAGTTATGG GCGCGGGCGC TGGGTGACGA GCGAGAAGAG
ATTGGCCGGG TAATCACCGA TTTTGATGCG GCGCTGCAAT CAAACGATAT GGCCCGCGTC
GATGAAGTTC GTCGACGGGC GAGCGATTAT TTGGCCATTG AGATCCCTTA A
 
Protein sequence
MDNAELAIGI DLGTTNSLIA VWKDGAAQLI PNKFGEYLTP SIISMDENNH ILVGKPAVSR 
RTSHPDKTAA LFKRAMGSNT NWRLGSDTFN APELSSLVLR SLKEDAEEFL QRPIKDVVIS
VPAYFSDEQR KHTRLAAELA GLNAVRLINE PTAAAMAYGL HTQQNTRSLV FDLGGGTFDV
TVLEYATPVI EVHASAGDNF LGGEDFTHML VDEVLKRAAV AKNMLNESEL AALYTSVEAA
KCSNQLPLQI SWQYQEETRE CEFYENELED LWLPLLNRLR VPIEQALRDA RLKPSQIDSL
VLVGGASQMP LVQRIAVRLF GKLPYQSYDP STIVALGAAI QAACRLRSED IEEVILTDIC
PYSLGVEVNR QGVSGIFSPI IERNTTVPVS RVETYSTMHP EQDSITINVY QGENHKVKNN
ILVESFDVPL KKTGAYQSID IRFSYDINGL LEVDVLLEDG SVKSRVINHS PVTLSTQQIE
ESRARLSALK IYPRDMLINR TFKAKLEELW ARALGDEREE IGRVITDFDA ALQSNDMARV
DEVRRRASDY LAIEIP