Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1800 |
Symbol | |
ID | 6145955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1819348 |
End bp | 1820409 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641616676 |
Product | hypothetical protein |
Protein accession | YP_001743854 |
Protein GI | 170681259 |
COG category | [S] Function unknown |
COG ID | [COG3768] Predicted membrane protein |
TIGRFAM ID | [TIGR01620] conserved hypothetical protein, TIGR01620 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.824337 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAC CGTTAAAACC ACGTATTGAT TTCGACGGTC CGCTGGATGT CGATCAGAAT CCTGAATTCA GGGCGCAGCA GACCTTTGAC GAAAATCAGG CACAAAATTT TGCCCCGGCC ACACTCGACG AAGCGCCTGA AGAAGAGGGG CAAGTTGAAG CGGTAATGGA TGCAGCGTTA CGTCCTAAAC GCAGCCTGTG GCGCAAAATG GTGATGGGCG GGCTGGCTCT GTTTGGCGCA AGCGTTGTCG GGCAGGGTGT ACAGTGGACA ATGAATGCCT GGCAAACCCA GGACTGGGTA GCGCTGGGTG GATGTGCTGC AGGGGCATTG ATTATCGGCG CTGGCGTAGG TTCTGTGGTA ACAGAGTGGC GGCGCTTATG GCGCTTGCGA CAGCGCGCCC ATGAACGCGA CGAAGCGCGT GATTTATTGC ACAGCCACGG CACCGGCAAA GGCCGCGCAT TTTGCGAAAA ACTGGCACAG CAGGCGGGCA TCGATCAGTC ACATCCGGCG CTGCAACGCT GGTATGCCTC AATCCATGAA ACGCAAAACG ACCGTGAAGT GGTCAGTTTG TATGCGCATT TGGTCCAGCC AGTTTTAGAT GCCCAGGCGC GGCGTGAAAT CAGCCGTTCG GCAGCGGAAT CAACATTGAT GATTGCGGTC AGCCCGCTGG CGCTGGTGGA TATGGCATTT ATCGCCTGGC GCAATCTGCG TTTGATTAAT CGCATCGCCA CGCTGTATGG CATTGAACTG GGGTATTACA GCCGTTTGCG CCTGTTTAAG CTGGTATTGC TGAATATCGC TTTCGCCGGA GCCAGCGAAC TGGTGCGCGA AGTGGGGATG GACTGGATGT CGCAAGATCT CGCTGCTCGT TTGTCTACCC GCGCAGCTCA GGGGATTGGT GCAGGACTTC TGACGGCACG ACTGGGGATT AAAGCTATGG AGCTTTGCCG CCCGCTGCCG TGGCTTGACG ATGACAAGCC ACGCCTCGGG GATTTCCGTC GTCAGCTTAT CGGTCAGGTG AAAGAAACTC TGCAAAAAGG CAAAACGCCC AGCGAAAAAT AA
|
Protein sequence | MTEPLKPRID FDGPLDVDQN PEFRAQQTFD ENQAQNFAPA TLDEAPEEEG QVEAVMDAAL RPKRSLWRKM VMGGLALFGA SVVGQGVQWT MNAWQTQDWV ALGGCAAGAL IIGAGVGSVV TEWRRLWRLR QRAHERDEAR DLLHSHGTGK GRAFCEKLAQ QAGIDQSHPA LQRWYASIHE TQNDREVVSL YAHLVQPVLD AQARREISRS AAESTLMIAV SPLALVDMAF IAWRNLRLIN RIATLYGIEL GYYSRLRLFK LVLLNIAFAG ASELVREVGM DWMSQDLAAR LSTRAAQGIG AGLLTARLGI KAMELCRPLP WLDDDKPRLG DFRRQLIGQV KETLQKGKTP SEK
|
| |