Gene EcSMS35_1533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1533 
Symbol 
ID6143700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1516732 
End bp1517988 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content51% 
IMG OID641616410 
Producthypothetical protein 
Protein accessionYP_001743588 
Protein GI170682120 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000116235 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00188577 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGATCTG ATGCGAAAAA CTTGATGAGC GACGGGAACG TGCAAATTGT TAAGACCGGC 
GAGGTCATTG GCGCGACGCA ACTTACTGAA GGCGAGTTAA TTGTTGAGGC TGGCGGAAGA
GCCGAAAATA CCGTGGTCAC GGGGGCTGGC TGGTTGAAAG TGGCAACCGG TGGGATCGCC
AAATGCACAC AGTACGGCAA CAATGGCACG CTATCGGTCA GCGACGGTGC CATTGCCACA
GATATTGTTC AGTCCGAGGG AGGCGCAATT AGTCTCTCTA CGCTCGCTAC GGTTAATGGC
CGCCATCCCG AAGGTGAATT CAGCGTTGAT AAAGGTTATG CCTGCGGTTT GTTGCTGGAA
AATGGCGGTA ACCTGCGTGT ACTGGAAGGC CATCGCGCGG AAAAAATTAT TCTCGATCAA
GAGGGTGGCC TGTTGGTCAA TGGGACAACC TCAGCGGTCG TGGTAGATGA AGGTGGTGAA
TTGTTGGTGT ATCCAGGTGG GGAAGCCAGC AATTGTGAGA TTAATCAGGG CGGCGTTTTT
ATGCTGGCGG GGAAAGCCAA TGATACGTTG CTTGCTGGTG GCACCATGAA TAATCTCGGT
GGTGAAGACT CTGACACTAT TGTTGAGAAT GGAGCCATCT ATCGTCTGGG GACGGATGGT
CTTCAGCTCT ACAGTTCCGG TAAGACGCAA AACCTGTCCG TTAATGTGGG TGGTCGGGCT
GAAGTGCATG CCGGTACGCT GGAAAATGCG GTAATACAAG GTGGGACAGT GATCCTGTTG
TCACCCACCA GCGCGGACGA AAATTTTGTC GTAGAGGAAG ATCGCGCACC GGTTGAACTG
ACCGGTAGTG TTGCATTACT GGACGGCGCT TCAATGATTA TTGGCTATGG CGCAGATCTG
CAACAATCAA CGATTACTGT ACAGCAGGGC GGTGTATTGA TTCTCGACGG CAGTACGATA
AAAGGTGACA GTGTCACTTT CAGTGTTGGT AACATCAATC TCAATGGCGG AAAACTGTGG
CTAATCACTG GTGCGGCAAC GCATGTGCAA CTTAAAGTGA AACGCCTGCG CGGAGAGGGA
GCGATTTGCC TGCAAACCAG TGCGAAAGAA ATTTCACCTG ACTTCATCAA TGTGAAAGGG
GAAGTTACTG GTGATATACA CGTTGAGATA ACAGATGCCA GTCGGCAAAC TCTGTGTAAC
GCACTGAAAC TACAGCCAGA CGAAGACGGG ATTGGCGCAA CGCTCCAGCC TGCGTAA
 
Protein sequence
MGSDAKNLMS DGNVQIVKTG EVIGATQLTE GELIVEAGGR AENTVVTGAG WLKVATGGIA 
KCTQYGNNGT LSVSDGAIAT DIVQSEGGAI SLSTLATVNG RHPEGEFSVD KGYACGLLLE
NGGNLRVLEG HRAEKIILDQ EGGLLVNGTT SAVVVDEGGE LLVYPGGEAS NCEINQGGVF
MLAGKANDTL LAGGTMNNLG GEDSDTIVEN GAIYRLGTDG LQLYSSGKTQ NLSVNVGGRA
EVHAGTLENA VIQGGTVILL SPTSADENFV VEEDRAPVEL TGSVALLDGA SMIIGYGADL
QQSTITVQQG GVLILDGSTI KGDSVTFSVG NINLNGGKLW LITGAATHVQ LKVKRLRGEG
AICLQTSAKE ISPDFINVKG EVTGDIHVEI TDASRQTLCN ALKLQPDEDG IGATLQPA