Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0803 |
Symbol | |
ID | 6146495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 807433 |
End bp | 808341 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641615691 |
Product | hypothetical protein |
Protein accession | YP_001742883 |
Protein GI | 170683940 |
COG category | [S] Function unknown |
COG ID | [COG0391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01826] conserved hypothetical protein, cofD-related |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000436595 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAATC GTACGCTGGC TGACCTTGAT CGTGTCGTTG CTCTCGGCGG AGGGCATGGA CTGGGACGCG TTCTCTCATC ACTTTCGTCT TTGGGTTCTC GTTTAACGGG TATCGTCACC ACCACGGATA ATGGTGGCTC GACGGGGCGT ATTCGCCGCT CAGAAGGCGG TATTGCCTGG GGCGATATGC GTAACTGCCT CAACCAGCTG ATAACTGAAC CCAGCGTCGC TTCCGCAATG TTTGAATATC GTTTCGGCGG TAATGGCGAA CTTTCCGGGC ACAACCTTGG AAACCTGATG CTGAAGGCAC TGGATCACCT TAGCGTGCGA CCTCTTGAAG CGATCAATCT GATTCGCAAT CTGCTGAAAG TGGACGCACA TTTGATTCCG ATGTCGGAGC ATCCTGTTGA TCTGATGGCG ATTGACGATC AGGGGCATGA AGTTTACGGC GAGGTCAATA TCGACCAGTT AACTGCGCCG ATTCAAGAGT TATTGTTAAC GCCGAATGTA CCCGCAACGC GTGAGGCGGT TCACGCTATC AGTGAAGCGG ATCTCATCAT TATTGGGCCT GGCAGTTTTT ACACCAGCCT GATGCCAATT CTGCTGCTGA ATGAAATCGC ACAAGCATTA CGCCGCACGC CAGCGCCGAT GGTCTATATC GGCAATCTGG GGCGTGAGTT GAGTTTACCT GCGGCTAATT TGAAGCTGGA AAGCAAGCTG GCAATTATGG AGCAGTATGT CGGTAAAAAA GTGATTGATG CGGTAGTTGT TGGGCCGAAA GCTGATGTAT CCGCAGTTAA CAACAGAATT GTGATTCAGG AAGTACTGGA GGCCAGCGAT ATCCCCTATC GTCATGACCG CCAGTTGTTA CATAACGCGC TGGAAAAGGC ATTACAGGCT TTAGGTTAA
|
Protein sequence | MRNRTLADLD RVVALGGGHG LGRVLSSLSS LGSRLTGIVT TTDNGGSTGR IRRSEGGIAW GDMRNCLNQL ITEPSVASAM FEYRFGGNGE LSGHNLGNLM LKALDHLSVR PLEAINLIRN LLKVDAHLIP MSEHPVDLMA IDDQGHEVYG EVNIDQLTAP IQELLLTPNV PATREAVHAI SEADLIIIGP GSFYTSLMPI LLLNEIAQAL RRTPAPMVYI GNLGRELSLP AANLKLESKL AIMEQYVGKK VIDAVVVGPK ADVSAVNNRI VIQEVLEASD IPYRHDRQLL HNALEKALQA LG
|
| |