Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2151 |
Symbol | |
ID | 6143058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2157613 |
End bp | 2158803 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617027 |
Product | hypothetical protein |
Protein accession | YP_001744201 |
Protein GI | 170680438 |
COG category | [R] General function prediction only |
COG ID | [COG1092] Predicted SAM-dependent methyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.111749 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.201335 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGTAC GTTTAGTGTT AGCCAAAGGG CGCGAAAAAT CATTACTTCG TCGCCATCCG TGGATCTTTT CCGGGGCCGT TGCCCGCATG GAAGGTAAAG CCAGCCTCGG TGAAACCATC GATATTGTTG ATCATCAGGG AAAATGGTTA GCACGCGGCG CTTATTCGCC AGCTTCGCAA ATCCGGGCGC GCGTCTGGAC GTTTGACCCG TCTGAGTCTA TCGACATTGC TTTTTTTACC CGACGTTTAC AACAAGCACA AAAATGGCGT GACTGGCTGG CGCAAAAAGA TGGCCTCAAC AGCTATCGTT TAATCGCCGG AGAATCTGAT GGCCTGCCGG GTATTACTAT CGATCGTTTC GGTAATTTTC TGGTGCTGCA ACTGCTGAGT GCTGGCGCAG AGTATCAGCG CGCGGCATTA ATTAGTGCCC TGCAAACGCT GTACCCGGAA TGTGCGATTT ACGATCGCAG CGATGTTGCG GTACGTAAAA AAGAAGGGAT GGAGCTGACC CAGGGCCCCG TCACCGGCGA GTTGCCGCCT GCCCTGCTGC CGATTGAAGA ACACGGCATG AAGCTGCTGG TGGACATACA GCACGGACAC AAAACGGGCT ACTACCTGGA TCAGCGTGAT AGCCGCCTGG CTACCCGCCG CTACGTTGAA AATAAACGCG TACTGAACTG TTTCTCCTAT ACCGGTGGTT TCGCCGTATC GGCACTGATG GGCGGTTGCA GCCAGGTTGT CAGCGTTGAT ACCTCCCAGG AAGCCCTGGA TATTGCACGG CAGAACGTTG AGCTGAACAA ACTGGATCTG AGCAAGGCTG AGTTTGTCCG TGATGATGTC TTTAAATTGC TGCGTACCTA TCGCGATCGC GGTGAAAAAT TTGACGTTAT CGTGATGGAC CCGCCGAAGT TTGTTGAGAA TAAAAGCCAG TTGATGGGCG CGTGTCGTGG CTATAAAGAC ATCAACATGC TGGCGATTCA GTTGCTGAAT GAAGGCGGAG TTCTCCTGAC TTTCTCCTGT TCTGGCCTGA TGACCAGCGA TTTATTTCAG AAAATCATCG CAGATGCCGC AATTGATGCC GGTCGTGATG TACAATTTAT AGAGCAGTTC CGTCAGGCAG CCGATCATCC GGTGATCGCT ACCTATCCGG AAGGGCTATA TCTGAAAGGG TTTGCCTGTC GCGTCATGTA A
|
Protein sequence | MSVRLVLAKG REKSLLRRHP WIFSGAVARM EGKASLGETI DIVDHQGKWL ARGAYSPASQ IRARVWTFDP SESIDIAFFT RRLQQAQKWR DWLAQKDGLN SYRLIAGESD GLPGITIDRF GNFLVLQLLS AGAEYQRAAL ISALQTLYPE CAIYDRSDVA VRKKEGMELT QGPVTGELPP ALLPIEEHGM KLLVDIQHGH KTGYYLDQRD SRLATRRYVE NKRVLNCFSY TGGFAVSALM GGCSQVVSVD TSQEALDIAR QNVELNKLDL SKAEFVRDDV FKLLRTYRDR GEKFDVIVMD PPKFVENKSQ LMGACRGYKD INMLAIQLLN EGGVLLTFSC SGLMTSDLFQ KIIADAAIDA GRDVQFIEQF RQAADHPVIA TYPEGLYLKG FACRVM
|
| |