Gene EcSMS35_0533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0533 
Symbol 
ID6144759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp542796 
End bp543812 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content48% 
IMG OID641615427 
Producthypothetical protein 
Protein accessionYP_001742634 
Protein GI170684227 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACTG TAAACGTAGC TTTACTGGCA CTCATAATTT CAGCAATATC CAGCCCTGTT 
GTTTTAGCTG GTGATACCAT TGAAGCGGCG GCAACAGAGC TTTCAGCCAT TAACTCTGGC
ATGTCGCAAT CGGAGATTGA GCAGAAGATT ACCCGCTTTT TAGAACGCAC AGACAACAGC
CCCGCTGCGT ATACCTATTT GACTGAACAT CACTACATCC CTTCTGAAAC ACCTGATACC
ACTCAGACTC CCACTGTCCA GACAGATCCT GACGCAGGAC AAAAAACCGT TGCCGCTACA
GGTGATGTAC AGACAACCGC CCGTTATCAG AGCATGATCA ACGCCCGACA GTCTGCGGTA
ACTGATGCCC AGCAAACGCA AATTACAGAG CAACAGGCGC AGATCGTAGC CACACAAAAA
ACGCTCGCCG CGACTGGAGA TACGCAAAAT ACCGCGCATT ATCAGGAGAT GATTAATGCC
AGACTGGCGG CTCAAAATGA GGCTAATCAG CGCACTACCA CGGAACAAGG GCAGAAAATG
AATGCACTGA CAACCGATGT GGCAGCACAA CAGCAAAAAG AAAGGGCTCA ATACGATAAA
CAAATGCAAA GTCTGGCGCA GAAGTCTGTC CAGGCACATG AGCAAATTGA AAGTCTGAGA
CAAGATTCCG CACAAACGCA GCAACAGTTA ACCAACACGC AAAAACGGGT CGCAGATAAC
AGCCAACAAA TTAACACGCT CAATAACCAT TTCGATTCTC TGAAAAACGA AGTTGAGGAC
AATCGTAAAG AAGCCAATGC GGGAACTGCA TCTGCCATTG CTATCGCCTC ACAACCACAG
GTGAAAACCG GTGACGTGAT GATGGTGTCA GCGGGAGCGG GAACGTTCAA CGGTGAATCT
GCGGTGTCTG TAGGAACATC TTTTAATGCC GGAACGCATA CGGTACTTAA AGCAGGTATT
TCTGCGGATA CACAATCTGA TTTCGGTGCG GGTGTCGGCG TGGGATATTC GTTCTAA
 
Protein sequence
MKTVNVALLA LIISAISSPV VLAGDTIEAA ATELSAINSG MSQSEIEQKI TRFLERTDNS 
PAAYTYLTEH HYIPSETPDT TQTPTVQTDP DAGQKTVAAT GDVQTTARYQ SMINARQSAV
TDAQQTQITE QQAQIVATQK TLAATGDTQN TAHYQEMINA RLAAQNEANQ RTTTEQGQKM
NALTTDVAAQ QQKERAQYDK QMQSLAQKSV QAHEQIESLR QDSAQTQQQL TNTQKRVADN
SQQINTLNNH FDSLKNEVED NRKEANAGTA SAIAIASQPQ VKTGDVMMVS AGAGTFNGES
AVSVGTSFNA GTHTVLKAGI SADTQSDFGA GVGVGYSF