Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1518 |
Symbol | |
ID | 6144678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1504139 |
End bp | 1505143 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616396 |
Product | LysM/ErfK/YbiS/YcfS/YnhG family protein |
Protein accession | YP_001743576 |
Protein GI | 170682137 |
COG category | [S] Function unknown |
COG ID | [COG1376] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000462996 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00000000671919 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACGCG CGTCTTTGCT TACACTCACG CTTATCGGCG CTTTTAGCGC CATCCAGGCT GCCTGGGCGG TTGATTATCC GCTACCACCA ACCGGAAGCC GACTGGTTGG GCAAAATCAA ACGTATACGG TGCAAGAAGG GGATAAAAAC CTTCAGGCCA TCGCCCGACG TTTTGATACT GCGGCAATGT TGATCCTTGA AGCCAATAAC ACTATCGCCC CGGTGCCAAA ACCTGGTACG ACGATAACTA TTCCTTCGCA ACTGTTATTA CCTGATGCGC CGCGTCAGGG GATTATCGTT AACCTGGCAG AGCTGCGCCT TTATTATTAT CCGCCGGGAG AAAATATCGT GCAGGTCTAT CCGATAGGTA TTGGTTTGCA GGGGCTGGAA ACGCCAGTGA TGGAAACGCG TGTGGGGCAG AAAATCCCTA ACCCAACCTG GACGCCTACG GCGGGCATTC GTCAGCGTTC GTTGGAGCGT GGCATTAAAT TACCGCCAGT CGTTCCTGCC GGGCCAAATA ACCCACTAGG ACGTTACGCA CTGCGCCTCG CGCATGGTAA TGGCGAATAC CTCATTCATG GCACCAGTGC GCCGGACAGC GTCGGTTTGC GCGTCAGTTC GGGCTGTATT CGCATGAATG CGCCGGATAT TAAAGCCTTA TTCTCCAGCG TGCGGACGGG AACGCCGGTG AAAGTGATCA ACGAACCGGT GAAATATTCC GTGGAACCGA ACGGGATGCG TTATGTTGAA GTACATCGAC CACTATCAGC AGAAGAACAG CAGAACGTTC AGACAATGCC ATACACGCTG CCAGCAGGCT TTACGCAATT TAAAGACAAT AAGGCTGTAG ATCAGAAGTT AGTCGATAAA GCGTTGTATC GTCGGGCAGG GTATCCGGTT GCGGTGAGCA GTGGAGCAAC TCCCGCAGCC AGCAATACGC CTTCAGTAGA GTCAGCGCAG AATGGTGAAC CAGAGCAAGG GAATATGTTA CGCGCGACGC AGTAG
|
Protein sequence | MKRASLLTLT LIGAFSAIQA AWAVDYPLPP TGSRLVGQNQ TYTVQEGDKN LQAIARRFDT AAMLILEANN TIAPVPKPGT TITIPSQLLL PDAPRQGIIV NLAELRLYYY PPGENIVQVY PIGIGLQGLE TPVMETRVGQ KIPNPTWTPT AGIRQRSLER GIKLPPVVPA GPNNPLGRYA LRLAHGNGEY LIHGTSAPDS VGLRVSSGCI RMNAPDIKAL FSSVRTGTPV KVINEPVKYS VEPNGMRYVE VHRPLSAEEQ QNVQTMPYTL PAGFTQFKDN KAVDQKLVDK ALYRRAGYPV AVSSGATPAA SNTPSVESAQ NGEPEQGNML RATQ
|
| |