Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1585 |
Symbol | |
ID | 6146897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1571129 |
End bp | 1572637 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641616462 |
Product | hypothetical protein |
Protein accession | YP_001743640 |
Protein GI | 170682142 |
COG category | [S] Function unknown |
COG ID | [COG5339] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAT CGCTGGTAGC GGTAGGCGTC ATTGTTGCGC TAGGCGTAGT CTGGACAGGC GGCGCATGGT ATACAGGCAA GAAGATTGAA ACCCATCTCG AAGACATGGT CGCGCAGGCG AACGCGCAAC TCAAACTGAC CGCTCCTGAA TCCAACCTGG AAGTGAGTTA TCAAAACTAT CATCGCGGCG TATTCAGCAG TCAGCTGCAA CTGTTGGTGA AACCCATTGC CGGGAAAGAA AATCCGTGGA TTAAAAGCGG TCAGAGCGTC ATCTTCAACG AATCGGTTGA TCATGGTCCC TTCCCCCTTG CCCAGCTAAA AAAACTGAAC CTGCTCCCGT CGATGGCATC AATTCAAACC ACGCTGGTTA ATAACGAAGT TAGCAAGCCA CTGTTTGATA TGGCAAAAGG TGAAACGCCT TTTGAGATTA ACTCCCGCAT TGGTTACAGC GGAGATTCCA GTTCCGATAT TTCGCTCAAG CCGCTGAATT ACGAGCAAAA GGATGAAAAA GTCGCCTTTA GCGGCGGCGA GTTCCAGTTA AATGCGGACA GAGACGGCAA AGCTATCTCC CTTTCCGGAG AGGCGCAAAG TGGTCGGATA GACGCAGTTA ACGAATACAA CCAGAAAGTG CAGTTGACCT TTAATAATCT GAAAACCGAC GGTTCCAGCA CGCTGGCAAG TTTTGGTGAG CGCGTAGGAA ACCAAAAACT GTCACTGGAA AAAATGACCA TTTCAGTGGA AGGCAAAGAA CTGGCACTGC TGGAAGGCAT GGAGATCAGC GGTAAATCGG ATCTGGTCAA TGACGGTAAA ACGATCAATA GCCAACTGGA TTACTCGCTA AACAGCCTGA AGGTACAGAA TCAGGATCTG GGCAGTGGCA AGCTGACTTT AAAAGTCGGC CAGATTGATG GTGAAGCCTG GCATCAGTTT AGCCAGCAAT ATAACGCGCA AACTCAGGCG CTGCTGGCAC AGCCAGAAAT TGCCAACAAT CCCGAACTTT ATCAGGAGAA AGTGACGGAA GCCTTCTTTA GCGCCCTGCC GCTGATGTTG AAAGGCGATC CGGTGATTAC TATCGCGCCG CTAAGCTGGA AAAACAGTCA GGGTGAAAGT GCGCTGAATC TGTCGCTGTT CCTGAAAGAT CCGGCAACGA CTAAAGAAGC GCCGCAAACG CTGACGCAGG AAGTAGATCG TTCGGTTAAA TCTCTGGATG CGAAACTGAC CATTCCGGTG GATATGGCAA CTGAGTTGAT GACTCAGGTA GCGAAGCTGG AAGGTTATCA GGAAGATCAA GCGAAAAAAC TGGCGAAACA GCAAGTTGAA GGTGCATCAG CAATGGGGCA GATGTTCCGT CTGACCACCT TGCAGGACAA TACCATCACC ACCAGCCTGC AATATGCTAA CGGTCAGATA ACGTTAAACG GGCAGAAAAT GCCACTGGAA GATTTCGTGG GTATGTTTGC GATGCCGACA TTAAACGTTC CGGCTGTACC CGCTATTCCG CAGCAGTAA
|
Protein sequence | MNKSLVAVGV IVALGVVWTG GAWYTGKKIE THLEDMVAQA NAQLKLTAPE SNLEVSYQNY HRGVFSSQLQ LLVKPIAGKE NPWIKSGQSV IFNESVDHGP FPLAQLKKLN LLPSMASIQT TLVNNEVSKP LFDMAKGETP FEINSRIGYS GDSSSDISLK PLNYEQKDEK VAFSGGEFQL NADRDGKAIS LSGEAQSGRI DAVNEYNQKV QLTFNNLKTD GSSTLASFGE RVGNQKLSLE KMTISVEGKE LALLEGMEIS GKSDLVNDGK TINSQLDYSL NSLKVQNQDL GSGKLTLKVG QIDGEAWHQF SQQYNAQTQA LLAQPEIANN PELYQEKVTE AFFSALPLML KGDPVITIAP LSWKNSQGES ALNLSLFLKD PATTKEAPQT LTQEVDRSVK SLDAKLTIPV DMATELMTQV AKLEGYQEDQ AKKLAKQQVE GASAMGQMFR LTTLQDNTIT TSLQYANGQI TLNGQKMPLE DFVGMFAMPT LNVPAVPAIP QQ
|
| |