Gene EcSMS35_0670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0670 
Symbol 
ID6144418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp680817 
End bp682268 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content48% 
IMG OID641615560 
ProductDnaJ domain-containing protein 
Protein accessionYP_001742766 
Protein GI170681976 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAT GTTGGCAAAT CCTCGAAATT GAAAGCACGA CGCAAATAGA CATTATCCGC 
CAGGCTTATC TTGCTCGCTT ACCGTTGTGT CATCCCGAAA CCGATCCGCA GGGGTTTAAA
GCATTACGCC AGGCCTATGA AGAGGCGCTG CGACTGGCGG TAAATCCTGT CGAGGAAGCA
GATGATGAAG AAAAAGATGC CGCCGCTGAA CATGAAATAC TACGTGCATT CAGGGCATTA
CTGGATTCTG AAAGTGATCG TTTTCAGCCC TCCGCCTGGC AGAAATTTAT TCAGCAATTA
AATACCTGGA ACATGGAGGA TGTCGATCAA TTACGCTGGC CGCTGTGTGC AGTCGCTATA
GAAGCGCGAT ATCTTTCATT AAATTGTGCT TCTTTGCTGG CAGAGCGTTT GAACTGGCAT
TCGTTTAATG ACAGCGAAGG AATGGATGAG GAAGAAAGGG AGGCTTTTCT TGAGGCCATT
CAGGCTGGTG ATTGTTTCGA TTTCCTTAGC CTTCTGGAAT ATCCCGTCGC GTTGCAGAAC
CAGACTGTTG AGTATTACTT CGCGCTGGAA CGCTGCTGCC GTTACCATCC TGACTATGTC
ACTGCGTTTA TGGCAATGGA AGGTCCGTGG TTCATTCCTG ATGATGTCAA GTTACATCGC
AAACTGTTGC GCTGGTACAG CTCGGTGAAA ACAGGTATGG CGGAACTTAT TCCTGTTGCC
CAACAGTGGC AAGCAGAAAA TCCAGAAAGC GAAGATGCCC GTTATTACCA GTGTGCGCAA
CGGTTGTACT GTGGCGAGGG GGAAAGCCTG CTTGCCGATC TCTGCGCATA CCGGGAGAGT
TACCCATCTA CACAAGCTGA TAATTTGTTG TTGCAGTGGA GCAAGCGTCA TTGCCCGGAT
TATTTCGCGT TATTAGTGAT GGTTATCGAA GCGCGAAGCA TGATAGATGC GCAAGGTCAA
CCGTTGAAAT ATGTTCCTGG TGAGAGCGCC CGGACGCGGC TGTTATGGGC GGAAATTTTA
CATAGCGGAA AATTATCGCC GTTAGGTCAA TCGTTTATTG AGTCGTTATT CTTCAAGCGC
AAAGCATGGG CATGGTGGAA ATCGAGAGTC GGTAGCGAGA CAGAACAAGA TTCACCGTTA
CTGGATTTGT ATCGGGTAGC GGAACAGGTT GTGCTGGAAG CGTTCCCGAA ACAAGAGATG
CTGGCCCGTC TTAATACAAG GCTGGAAGGT GGAGATGCTC ATCCATTAGA GGCCATTATC
ACGCGGATGC TGTTGACGAA AGTGAAACTC GAGCCGGAGG ATGAAGATGT CGATGAGCCA
ACACCTGAAA ACCACGAAGA AAAAAATGAT GAGGATGAAA AACCACAGAG CATTACCAGC
ATTATCAAAA TCAGCTTAAC GGTGCTGGTG ATAGGTTACG CGATCGGTAA AATCGCGATG
TTGTTCAGTT AG
 
Protein sequence
MKTCWQILEI ESTTQIDIIR QAYLARLPLC HPETDPQGFK ALRQAYEEAL RLAVNPVEEA 
DDEEKDAAAE HEILRAFRAL LDSESDRFQP SAWQKFIQQL NTWNMEDVDQ LRWPLCAVAI
EARYLSLNCA SLLAERLNWH SFNDSEGMDE EEREAFLEAI QAGDCFDFLS LLEYPVALQN
QTVEYYFALE RCCRYHPDYV TAFMAMEGPW FIPDDVKLHR KLLRWYSSVK TGMAELIPVA
QQWQAENPES EDARYYQCAQ RLYCGEGESL LADLCAYRES YPSTQADNLL LQWSKRHCPD
YFALLVMVIE ARSMIDAQGQ PLKYVPGESA RTRLLWAEIL HSGKLSPLGQ SFIESLFFKR
KAWAWWKSRV GSETEQDSPL LDLYRVAEQV VLEAFPKQEM LARLNTRLEG GDAHPLEAII
TRMLLTKVKL EPEDEDVDEP TPENHEEKND EDEKPQSITS IIKISLTVLV IGYAIGKIAM
LFS