Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4883 |
Symbol | |
ID | 6144391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4999070 |
End bp | 5000350 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641619687 |
Product | hypothetical protein |
Protein accession | YP_001746794 |
Protein GI | 170681238 |
COG category | [S] Function unknown |
COG ID | [COG2733] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.82212 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.899072 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAC TCATTGAACT CAGACGCGCC AAAATGTTGG CGCTCTCTTT ACTGCTTATC GCCGCTGCTA CCTTTGTCGT TACGCTGTTT TTGCCGCCCA ATTTTTGGGT GAGCGGCGTG AAGGCGATTG CTGAAGCGGC GATGGTCGGC GCGCTGGCGG ACTGGTTTGC GGTGGTGGCG CTATTTCGCC GCGTGCCGAT TCCGATCATT TCTCGTCATA CGGCGATTAT CCCGCGTAAT AAAGACCGGA TTGGCGAAAA TCTCGGTCAG TTCGTGCAGG AAAAATTTCT CGATACCCAA TCGCTGGTGG CATTGATTCG ACGTCACGAA CCGGCGTTGT TGATTGGCAA CTGGTTTAGT CAGCCAGAAA ACGCCCGCCG CGTTGGTCAG CATCTGTTGC AGATCATGAG TGGTTTTCTT GAACTGACCG ATGATGCGCG TATTCAGCGC CTGCTTAAGC GCGCAGTCCA TCGGGCGATT GATAAGGTCG ATCTTTCCGG CACCAGTGCG TTGATGCTGG AGAGTATGAC CAAAAACGAT CGTCATCAGG TGCTGCTGGA TACGCTGATC GCACAGTTGA TCGCCCTTCT CCAGCGCGAT AAATCGCGCA AGTTTATCGC CCAGCAGATT GTTCGCTGGC TGGAGAGTGA GCATCCACTG AAAGCCAAAA TTTTGCCCAC TGAATGGCTG GGCGAACATA GCGCGGAGTT GGTTTCTGAC GCGGTGAATT CTTTGCTTGA TGATATTAGT CGCGATCGTG CGCATCAGAT CCGCCATGCG TTTGATCGCG CCACCTTCGC CCTGATCGAC AAGCTGAAAA ACGATCCGGA AATGGCAGCG CGAGCCGATG CCGTAAAAAG CTATCTGAAA GAAGATGAAG CTTTTAATCG CTATCTCAGT GAATTGTGGG GGGATTTACG GGAATGGCTG AAAGTGGATA TCAACAGTGA AGATTCTCGT GTGAAAGAAC GTATCGCACG AGCGGGTCAA TGGTTTGGTG AAACGTTAAT TGCCGATGAT GCCTTGCGGG CGTCGTTAAA TGGTCACCTG GAACAAGCCG CGCACCGCGT CGCGCCTGAG TTTTCCGCAT TCCTGACGCG CCACATCAGC GATACAGTAA AAAGCTGGGA TGCGCGGGAT ATGTCGCGGC AAATCGAGTT AAATATTGGC AAAGATCTGC AGTTTATCCG TGTCAACGGT ACGCTAGTTG GCGGTTGTAT TGGGCTAATT TTGTATTTGC TGTCGCAGCT CCCGGCCTTG TTCCCCCTCG GCAATTTTTA G
|
Protein sequence | MNKLIELRRA KMLALSLLLI AAATFVVTLF LPPNFWVSGV KAIAEAAMVG ALADWFAVVA LFRRVPIPII SRHTAIIPRN KDRIGENLGQ FVQEKFLDTQ SLVALIRRHE PALLIGNWFS QPENARRVGQ HLLQIMSGFL ELTDDARIQR LLKRAVHRAI DKVDLSGTSA LMLESMTKND RHQVLLDTLI AQLIALLQRD KSRKFIAQQI VRWLESEHPL KAKILPTEWL GEHSAELVSD AVNSLLDDIS RDRAHQIRHA FDRATFALID KLKNDPEMAA RADAVKSYLK EDEAFNRYLS ELWGDLREWL KVDINSEDSR VKERIARAGQ WFGETLIADD ALRASLNGHL EQAAHRVAPE FSAFLTRHIS DTVKSWDARD MSRQIELNIG KDLQFIRVNG TLVGGCIGLI LYLLSQLPAL FPLGNF
|
| |