Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1027 |
Symbol | |
ID | 6146594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1049057 |
End bp | 1050235 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 641615914 |
Product | putative polysaccharide biosynthesis protein |
Protein accession | YP_001743106 |
Protein GI | 170682445 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAG AAATTTATAA ATATGGCACA ATAATATTAT GCTATGGAAT TAACTTTATA CTACCTATTC TTTTCTTTAA AGAAATAAGT TTAATATTAT CTCCAATAGT ATTATCATCA TTTTTCTTTG TTTTGGCATT GCTAATGTAT TGTCAATTCA TTGTTGAATA CGGGTTGCAA ATATCATTAT TAAGAAAGTT AAATGAAAAT AAGAGTGATT TAAGTCGGCT TTTATCTACT ATTCTTGGAC TAAAAGTAAT ACTATTCGTC CTATGCGCAG TATTCATTTA TTGTATTTTG CTATATAATA ATGAAGTTAT ATTTTTTATA CTTGTTTTCA TTCTGCTAGG AAACGTATTC TCATGCCAGT TCCTCTATCA AGTAGTTGAT CAGCTCCATT TTTTTTATGT ATTAAATTCG CTTGTTAAGC TAATTTTCAT ACCCCTCATT TTTATCAATG ATAATTATAT CTATCTGATG ATATGTTATA GTTGTTTCAA TATCCTACCT AACTTAGCTG CACTTGCTTA TTTTTTATAT AAGAATAAAG TTAAGGTTGT AAAAGTTCCT GCTTGTGAAA TTTATAATTT ATCTAAGGAA TGTTTTAGCT ATTTTATCTC GAATATTTCG ATAACATTAT ACACTAATTT TTATCAGATC TTATTAGGCA TTATATCGCC CGCTTATCTG GCCGCGTATG CATTAAGTGA CAAAATTATT AGGGGGATCG TGTCAGGGAA TTATGTGTTT ATACAGATAG TACAAGTACA ATACTTAAAT GATGAATCAA AGTTCATCAA TGGTACATGG AAAACGATTG TTTCATTATT ATTACTGGGG GGGATGGAGT TTTTATTTAT TTTCTTTGGA GCGGATTTTT TCTCTCGTTA TTTATTCCCT ACAATATCTT TACTGCCAAT TTTTCTAAAA CTAATGTCAT TGCTAATTGT TGTTATCTTG TTGAGTAACT TTTTTGCAAT GGTTTATCTT CCGTGTTCAG GCAATACTTT ATTCCTCTCA CGTGTTCTGA TTTTTGTTTC TATTGTGTCC GTAGTAGTTG CCCCGATACT AATATGGCTC TATGCAGGTT ATGGTGCAGT CATTTCCGCT ATATTTTCAG AGTTATTAGT TCTGATTTTA TGCTATGGTT TATATTTGAA ATTAAAACAC AAACTCTAA
|
Protein sequence | MKKEIYKYGT IILCYGINFI LPILFFKEIS LILSPIVLSS FFFVLALLMY CQFIVEYGLQ ISLLRKLNEN KSDLSRLLST ILGLKVILFV LCAVFIYCIL LYNNEVIFFI LVFILLGNVF SCQFLYQVVD QLHFFYVLNS LVKLIFIPLI FINDNYIYLM ICYSCFNILP NLAALAYFLY KNKVKVVKVP ACEIYNLSKE CFSYFISNIS ITLYTNFYQI LLGIISPAYL AAYALSDKII RGIVSGNYVF IQIVQVQYLN DESKFINGTW KTIVSLLLLG GMEFLFIFFG ADFFSRYLFP TISLLPIFLK LMSLLIVVIL LSNFFAMVYL PCSGNTLFLS RVLIFVSIVS VVVAPILIWL YAGYGAVISA IFSELLVLIL CYGLYLKLKH KL
|
| |