Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2216 |
Symbol | |
ID | 6146396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2232062 |
End bp | 2233822 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641617092 |
Product | hypothetical protein |
Protein accession | YP_001744266 |
Protein GI | 170683951 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.522723 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.321049 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCAAA CATTTATCCC CGGCAAAGAT GCCGCACTGG AAGATTCCAT CGCTCGCTTC CAGCAAAAAC TTTCAGACCT CGGCTTTCAG ATTGAAGAGG CCTCCTGGCT GAATCCCGTG CCTAACGTCT GGTCTGTACA TATTCGCGAC AAAGAGTGCG CACTGTGTTT TACCAACGGT AAAGGCGCAA CCAAAAAAGC GGCGCTGGCT TCTGCACTCG GTGAATATTT CGAGCGTCTC TCAACCAACT ACTTTTTTGC GGACTTCTGG CTGGGCGAAA CCATCGCCAA CGGCCCATTC GTGCATTATC CCAACGAAAA ATGGTTCCCA CTGACCGAAA ATGACGATGT ACCAGAAGGG CTGCTCGATG ACCGTCTGCG CGCATTTTAC GATCCGGAGA ATGAACTGAC CGGTAGTATG CTGATTGACC TGCAATCCGG TAACGAAGAT CGTGGTATTT GTGGTCTACC GTTTACGCGC CAGTCCGACA ACCAGACCAT TTATATTCCG ATGAATATCA TTGGTAACTT GTACGTTTCT AACGGTATGT CTGCTGGCAA TACCCGTAAC GAAGCACGCG TTCAGGGATT GTCCGAAGTT TTCGAACGCT ACGTGAAAAA CCGCATTATT GCTGAAAGCA TCAGCTTGCC GGAAATCCCG GCAGACGTGC TGGCGCGTTA CCCAGCGGTG GTTGAAGCGA TCGAAACACT GGAAGCAGAA GGTTTCCCAA TCTTTGCTTA TGATGGTTCG CTTGGCGGCC AGTATCCGGT GATTTGCGTG GTTCTGTTTA ATCCTGCTAA CGGCACCTGC TTTGCCTCTT TCGGTGCGCA TCCTGATTTT GGCGTAGCAC TGGAACGTAC CGTGACCGAG CTGCTGCAAG GTCGTGGCCT GAAAGATTTG GATGTGTTTA CTCCGCCAAC CTTCGATGAT GAAGAAGTTG CTGAACATAC CAACCTCGAA ACGCACTTTA TCGATTCCAG CGGTTTAATC TCCTGGGACC TGTTCAAGCA GGATGCCGAT TATCCGTTTG TGGACTGGAA TTTCTCCGGC ACTACGGAAG AAGAGTTTGC TACGCTGATG GCTATCTTCA ACAAAGAAGA TAAAGAAGTT TATATTGCCG ATTACGAGCA TCTGGGCGTT TATGCTTGCC GTATTATCGT GCCTGGCATG TCCGATATTT ATCCGGCTGA AGATCTGTGG TTAGCGAATA ACAGTATGGG TAGCCATTTA CGTGAAACCA TTCTCTCGCT ACCAGGCAGT GTGTGGGAAA AAGAAGATTA CCTGAACCTC ATCGAGCAAC TGGATGAAGA AGGTTTTGAT GACTTTACTC GCGTACGTGA ACTACTGGGT CTGGCGACCG GACCGGATAA CGGTTGGTAC ACCCTGCGTA TCGGTGAATT AAAAGCCATG CTGGCGCTGG CTGGTGGCGA TCTGGAACAG GCTCTGGTCT GGACTGAATG GACGATGGAA TTTAACTCAT CAGTGTTCAG TCCGGAACGC GCCAACTATT ATCGCTGTCT GCAAACGTTG TTATTACTGG CACAGGAAGA AGACCGCCAA CCGCTGCAAT ATCTGAATGC CTTCGTTCGT ATGTATGGCG CAGACGCCGT GGAAGCCGCC AGTGCGGCAA TGAGCGGCGA AGCGGCGTTT TATGGCTTGC AACCAGTAGA TAGCGATCTG CACGCGTTTG CTGCACATCA ATCGTTGCTG AAAGCCTACG AAAAGCTGCA GCGCGCCAAA GCGGCATTCT GGGCAAAATA A
|
Protein sequence | MTQTFIPGKD AALEDSIARF QQKLSDLGFQ IEEASWLNPV PNVWSVHIRD KECALCFTNG KGATKKAALA SALGEYFERL STNYFFADFW LGETIANGPF VHYPNEKWFP LTENDDVPEG LLDDRLRAFY DPENELTGSM LIDLQSGNED RGICGLPFTR QSDNQTIYIP MNIIGNLYVS NGMSAGNTRN EARVQGLSEV FERYVKNRII AESISLPEIP ADVLARYPAV VEAIETLEAE GFPIFAYDGS LGGQYPVICV VLFNPANGTC FASFGAHPDF GVALERTVTE LLQGRGLKDL DVFTPPTFDD EEVAEHTNLE THFIDSSGLI SWDLFKQDAD YPFVDWNFSG TTEEEFATLM AIFNKEDKEV YIADYEHLGV YACRIIVPGM SDIYPAEDLW LANNSMGSHL RETILSLPGS VWEKEDYLNL IEQLDEEGFD DFTRVRELLG LATGPDNGWY TLRIGELKAM LALAGGDLEQ ALVWTEWTME FNSSVFSPER ANYYRCLQTL LLLAQEEDRQ PLQYLNAFVR MYGADAVEAA SAAMSGEAAF YGLQPVDSDL HAFAAHQSLL KAYEKLQRAK AAFWAK
|
| |