Gene EcSMS35_A0059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_A0059 
Symbol 
ID6106587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010488 
Strand
Start bp46231 
End bp47592 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content57% 
IMG OID641614806 
Producthypothetical protein 
Protein accessionYP_001739947 
Protein GI170650805 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.322445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTAG CAGCAATCAC CATGACCGCC CCGGAAGCCG CCAGCCCTGT GCAGATGTAC 
CGCGCGACCT ACTCACCGGA TGACAACAAA CTGCGCCTGT ATGCTGCGTC ACGTCTTGAC
CCGGAGACGT ATAAAAAAGT GCATGATGCC GGTTTTCGCT GGGCACCAAA ACAGGCGCTG
TTTGTGGCTC CAGCCTGGAC ACCGGGCCGG GAAGACGTGC TCCTCTCACT TGCCGGAGAG
ATTGAGGATG AAGACAGCAC GCTCGCTGAA CGTCAGGAAG CACGGGCGGA GCGGTTTACC
GGATACAGCG GAAAGCGGGC CAGTGAATCC GCACAGGCAC TTGATGAAGT GGAAAGACTG
GCCGCGATGA TCCCGCCCGG TCAGCCCATT CTTGTGGGGC ATCACAGCGA ACGCCGCGCC
CGTCGTGATG CGCAGCGTAT TGAAAACGGC ATGAAACGTG CCGTGATGCT CGTTGAACGT
GCGGAATACT GGGAAGAGCG GGCGCGGTCG GCACTGCTTC ACGCGAAGTA TAAAGAACGT
CCGGACGTTC GCTGGCGTCG TATCAAAAAA ATCGAAGCTG ATTTGCGCAA GGCTGAAAAG
ACCATCGCGC AGTCGCAGAA ATATCTGACG ATGTGGCGGG CTGAATCGCT GGATCTGAAT
ATGGCAAAAC TCATCAGCAG TCATGACCAT ATCAGCGCCT GTTTCCCGCT GGATACGTAT
CCGCGCCCGG CAGAAAAAAG CCAGTATGAA GGGAGCCGGT CGTTATGGTC GGCCCTGGAT
GATGACATCA TCACCACGGA GCAGGCCCGC GAAATTGCGA TCCGCTATCA TGAACGGCAG
ATTCAGCATC AGCAACGCTG GGTTAACCAC TATCAGAACC GCCTGATCTA TGAGCGTGCC
ATGCTGGACG AAAGCGGCGG CGTGGTTACC CGGACACAGG ATTTTGAGCC GGGCGGACAG
GTTTTCAGCC GGGGCGAGTG GCTGACCATC ATCCGCGTGA ACAAAAGCAA CGGGGCGGTG
AGTTCAGTCA CAACGCCGAA TTACAGTTTT CTCGGGTACA GCGGCACGAT GAAAGTGACA
CCCGATCGCA TCACGGACTA CAAAGCACCA TCGGCAGAAG AGGCTGCCAT CGCCAGCCAG
GCCGCGAAGC GTCCGCCGGT AGTCAACTAT CCGGGGGAAG GTTTCCGGGA AATGACAAAG
GCACAGTGGG CCGCCCTGCC CCGGGACTGT AAGGCCGTGC GCAGTGTGGC AGAAGCAGAA
GACCACGGGG CATACCGCTA CCGCCGCACA ATGGACAATA ATTTCCGTCT GGTGAATGTG
TATATCACCG ACATGAAAAT TACGGAAATC CCACAGAAAT AA
 
Protein sequence
MTLAAITMTA PEAASPVQMY RATYSPDDNK LRLYAASRLD PETYKKVHDA GFRWAPKQAL 
FVAPAWTPGR EDVLLSLAGE IEDEDSTLAE RQEARAERFT GYSGKRASES AQALDEVERL
AAMIPPGQPI LVGHHSERRA RRDAQRIENG MKRAVMLVER AEYWEERARS ALLHAKYKER
PDVRWRRIKK IEADLRKAEK TIAQSQKYLT MWRAESLDLN MAKLISSHDH ISACFPLDTY
PRPAEKSQYE GSRSLWSALD DDIITTEQAR EIAIRYHERQ IQHQQRWVNH YQNRLIYERA
MLDESGGVVT RTQDFEPGGQ VFSRGEWLTI IRVNKSNGAV SSVTTPNYSF LGYSGTMKVT
PDRITDYKAP SAEEAAIASQ AAKRPPVVNY PGEGFREMTK AQWAALPRDC KAVRSVAEAE
DHGAYRYRRT MDNNFRLVNV YITDMKITEI PQK