Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2276 |
Symbol | |
ID | 6147043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2299911 |
End bp | 2301476 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641617150 |
Product | hypothetical protein |
Protein accession | YP_001744323 |
Protein GI | 170682431 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00656399 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.253648 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAAAT TATCCCCCCC TGAACTTGAC ATAACTAAAC TGCCTGACAG GTGCCAGGCA TTACTTGATG AAATGCATGA AGAAACGGGA ATAAGCCGTG AAATACTGCT GTCTGTTATG CTGACGGTCA AGGCTGTCTC GGTCCAGGAT ACGCATGAAG TTGAACTTTC TGGAGGGCAG CGTACCAGTC TTCAGATATA TATGTGTCTT TCATCAGCAT CAGGCAGTGG AAAAACCTCT GCCTGCGCAA AATTAATCGC CCCTGTTCAT GAAACAGAAG AAGAACTACA TCAGGCATAT ATCGACGATA AAAAAAACTA TGATCGCATG ATGGAAATGT GGACAACAGA TAAAAAAATC CTGGAGCGGA GATATAAAAA GGAAATGGAA AGATCCCCGG AAAATGCTGC TGCTGCACGA GCAGCGCTTG AAAAGTGTAT AGCAAATAAA CCCGTACCTC CCGTACAACA GGTTCTCATC GTGAATGACG CGACACCAGA AGGGATAGCT TTGAAACTCA GCCAGTCACC TTCTCTTCTG TTGCTGTCTG ATGAAGGAGG AACTATTCTG GATAAGCGTT TCGAACGCAA ATCGGCGCTG TACAACACAT TATGGAGTGG GCAACCAGTC ACCGTAGAAC GGGCATCCAG ACCCGGGTTC CGGGTAAAGG ATTCCCGTCT CACTATGCTT ATACTGACTC AACCGGTAAT ATTTGATAAG TTTTTTACTC TCACTGGCGA CCAGATCCGG GGCAATGGCT TTCTGGCCCG GGTGCTATTT TGCGAACCCG GAGCAAATAA AATAATGACG ACAGAACCGC ATACTGCCAC TCCTGTTGTA ACGCAGCAAT GTGCAACCTG TTTTCATGAG AAAAGTTTTG GCAGTCAGAT CAGAGACTCT CTGAGAGCGT CCCGGGAAAG GCGTGCAAAA GGTGAGCAAC GCATCCGCAT GACATTATCA CACGCAGCAT CCCACGATCT GGAAATCTTT CACGAAGAAA ATATGAGTGC TGTCAGGCAG AACCCACGTA TGACCACTTT CGAAGACATT ATCGTCAGGA AAAGAGAACA GGCTGTTCGA ATAGCCGCAC TTCTGGAGCT GGAGAACAAT CCACATGGTA CAGTAATAAC ACGAGAAAGT ATCAATAGTG CTATTTATCT CGTTGATTTT TATTTTCAGC ATCTTATATC CAAACTGGAA TCACTTCGGG AAATATCTCC TGCAGAAAAA CTGGATAAAT GGCTCAGAGA AAATATCATC CGGGTAAAAG GATACGAATA TCAAAAAAGC CATATACTAC AATATGGTCC ATATGCTCTC AGGAATAAAT GTGTTCTTGA TGAAGCACTT GATATACTTG CAGAACAGAA AAAAATCGTA ATTGATTATT CAATCGGCCA AAAAATCATT TATATCGGTG ACGCGATTAC ACCTTGTGAA TTAGCAAATG AGGCAAACAT CCCGATAATG GAGCGTGGAA TGTTTATAGT CTGTTGGGAC CATAAGCTGA ATAAGTACAG AAATGAAGAA CAGACTAAAA TATGGAACAT AACTACAAAA AATTAA
|
Protein sequence | MTKLSPPELD ITKLPDRCQA LLDEMHEETG ISREILLSVM LTVKAVSVQD THEVELSGGQ RTSLQIYMCL SSASGSGKTS ACAKLIAPVH ETEEELHQAY IDDKKNYDRM MEMWTTDKKI LERRYKKEME RSPENAAAAR AALEKCIANK PVPPVQQVLI VNDATPEGIA LKLSQSPSLL LLSDEGGTIL DKRFERKSAL YNTLWSGQPV TVERASRPGF RVKDSRLTML ILTQPVIFDK FFTLTGDQIR GNGFLARVLF CEPGANKIMT TEPHTATPVV TQQCATCFHE KSFGSQIRDS LRASRERRAK GEQRIRMTLS HAASHDLEIF HEENMSAVRQ NPRMTTFEDI IVRKREQAVR IAALLELENN PHGTVITRES INSAIYLVDF YFQHLISKLE SLREISPAEK LDKWLRENII RVKGYEYQKS HILQYGPYAL RNKCVLDEAL DILAEQKKIV IDYSIGQKII YIGDAITPCE LANEANIPIM ERGMFIVCWD HKLNKYRNEE QTKIWNITTK N
|
| |