Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0533 |
Symbol | |
ID | 6144759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 542796 |
End bp | 543812 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641615427 |
Product | hypothetical protein |
Protein accession | YP_001742634 |
Protein GI | 170684227 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport [W] Extracellular structures |
COG ID | [COG5295] Autotransporter adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACTG TAAACGTAGC TTTACTGGCA CTCATAATTT CAGCAATATC CAGCCCTGTT GTTTTAGCTG GTGATACCAT TGAAGCGGCG GCAACAGAGC TTTCAGCCAT TAACTCTGGC ATGTCGCAAT CGGAGATTGA GCAGAAGATT ACCCGCTTTT TAGAACGCAC AGACAACAGC CCCGCTGCGT ATACCTATTT GACTGAACAT CACTACATCC CTTCTGAAAC ACCTGATACC ACTCAGACTC CCACTGTCCA GACAGATCCT GACGCAGGAC AAAAAACCGT TGCCGCTACA GGTGATGTAC AGACAACCGC CCGTTATCAG AGCATGATCA ACGCCCGACA GTCTGCGGTA ACTGATGCCC AGCAAACGCA AATTACAGAG CAACAGGCGC AGATCGTAGC CACACAAAAA ACGCTCGCCG CGACTGGAGA TACGCAAAAT ACCGCGCATT ATCAGGAGAT GATTAATGCC AGACTGGCGG CTCAAAATGA GGCTAATCAG CGCACTACCA CGGAACAAGG GCAGAAAATG AATGCACTGA CAACCGATGT GGCAGCACAA CAGCAAAAAG AAAGGGCTCA ATACGATAAA CAAATGCAAA GTCTGGCGCA GAAGTCTGTC CAGGCACATG AGCAAATTGA AAGTCTGAGA CAAGATTCCG CACAAACGCA GCAACAGTTA ACCAACACGC AAAAACGGGT CGCAGATAAC AGCCAACAAA TTAACACGCT CAATAACCAT TTCGATTCTC TGAAAAACGA AGTTGAGGAC AATCGTAAAG AAGCCAATGC GGGAACTGCA TCTGCCATTG CTATCGCCTC ACAACCACAG GTGAAAACCG GTGACGTGAT GATGGTGTCA GCGGGAGCGG GAACGTTCAA CGGTGAATCT GCGGTGTCTG TAGGAACATC TTTTAATGCC GGAACGCATA CGGTACTTAA AGCAGGTATT TCTGCGGATA CACAATCTGA TTTCGGTGCG GGTGTCGGCG TGGGATATTC GTTCTAA
|
Protein sequence | MKTVNVALLA LIISAISSPV VLAGDTIEAA ATELSAINSG MSQSEIEQKI TRFLERTDNS PAAYTYLTEH HYIPSETPDT TQTPTVQTDP DAGQKTVAAT GDVQTTARYQ SMINARQSAV TDAQQTQITE QQAQIVATQK TLAATGDTQN TAHYQEMINA RLAAQNEANQ RTTTEQGQKM NALTTDVAAQ QQKERAQYDK QMQSLAQKSV QAHEQIESLR QDSAQTQQQL TNTQKRVADN SQQINTLNNH FDSLKNEVED NRKEANAGTA SAIAIASQPQ VKTGDVMMVS AGAGTFNGES AVSVGTSFNA GTHTVLKAGI SADTQSDFGA GVGVGYSF
|
| |