Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1840 |
Symbol | |
ID | 6145131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1862859 |
End bp | 1863986 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616716 |
Product | hypothetical protein |
Protein accession | YP_001743894 |
Protein GI | 170682969 |
COG category | [S] Function unknown |
COG ID | [COG4950] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0000000000203222 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAACAAC GCCACATCAC TGGCAAAAGC CACTGGTATC ATGAAACGCA ATCCAGTACT GCGGAGTATG ACGTTCTGCC TCTGGTCCCG GAAGCCGCAA AGGTCAGCGA TCCCTTTCTG CTCGACGTGA TCCTTGATGA AGAAACGCTG GCACCCTTCC TTTCATGGCT GGTCCCTGCG CGCGTTCTTG TAGTAGAGTT GTTCCCTGAC CAGCTTACCG TGACCCGTTC ACAGACCTTC ACCGCTTATG AACGCTTGTC GACGGCCCTT ACGGTTGCTC AGGTTTGCGG CGTCCAACGG TTATGTAACT ACTATTCGGC GCGACTTACG CCGCTCCCCG GGCCTGATTC CTCCAGGGAA AGTAATCATC GGTTGGCACA AATCACGCAA TATGCCCGCC AACTGGCTAG CTCGCCTTCT ATTATCGACA ACCGATCGCG CCAGCATCTG AATGACGTTG GTCTTACTGC CTGTGACTGT GTGATCATTA ACCAAATCAT TGGTTTTATT GGCTTTCAGG CGCGGACAAT TGCGACATTT CAGGCTTATC TCGGGCATCC AGTACGCTGG TTACCCGGGC TGGAGATACA AAACTACGCC GACGCATCAC TGTTTGCTGA TGAATCATTA CGCTGGCGAA GCAGCTATGA GGTGGAAAAA CTCCCTGAAG AGCACACAAA AAGTTCAACT GCAGAACTTT GCCAACTGGC CAACACACTC TCTCTCCACC CTATTTCACT TTCCCTTCTC GAAAAGTTGT TAAACAGCAC ACGGGTCAAT ACACAGCCGG ATAATCAGCT TGCGGCGTTG TTATGCGCGC GGATAAATGG CAGTCCTGCT TGTTTTGCCG CCTGTATGAA TTCATCAAAT GAATATAAAA AAATCAGCCT CCTTCTGCGC AAGGGCGAAA ATGAAATTAA CCGATGGGCT GACCGTCATT CTGTTGAGCG CGCTACCGTT CAGGCGATAC AATGGCTGAC CCGAGCACCC GATCGCTTTA GCGCCGCCCA GTTCAGCCCA TTACTCGAAC ACGAAAAATC ATCAACGCAG ATTATTAATC TGCTGGTATG GAGCGGGCTG TGTGGCTGGA TAAATCGTTT AAAAATCGCG TTGGGTGAGA CATATTAA
|
Protein sequence | MEQRHITGKS HWYHETQSST AEYDVLPLVP EAAKVSDPFL LDVILDEETL APFLSWLVPA RVLVVELFPD QLTVTRSQTF TAYERLSTAL TVAQVCGVQR LCNYYSARLT PLPGPDSSRE SNHRLAQITQ YARQLASSPS IIDNRSRQHL NDVGLTACDC VIINQIIGFI GFQARTIATF QAYLGHPVRW LPGLEIQNYA DASLFADESL RWRSSYEVEK LPEEHTKSST AELCQLANTL SLHPISLSLL EKLLNSTRVN TQPDNQLAAL LCARINGSPA CFAACMNSSN EYKKISLLLR KGENEINRWA DRHSVERATV QAIQWLTRAP DRFSAAQFSP LLEHEKSSTQ IINLLVWSGL CGWINRLKIA LGETY
|
| |