Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2195 |
Symbol | |
ID | 6145177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2204363 |
End bp | 2206210 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617071 |
Product | hypothetical protein |
Protein accession | YP_001744245 |
Protein GI | 170683737 |
COG category | [S] Function unknown |
COG ID | [COG2989] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.555237 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.324858 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGCTTA ATATGATGTG TGGTCGTCGG CTGTCGGCAA TCAGTTTGTG CGTGGCCGTA ACATTCGCTC CACTGTTCAA TGCGCAGGCC GATGAGCCTG AAGTAATCCC TGGCGACAGC CCGGTGGCTG TCAGTGAACA GGGCGAGGCA CTGCCGCAGG CGCAAGCCAC GGCAATTATG GCGGGGATCC AGCCATTGCC TGAAGGTGCG GCAGAAAAAG CCCGCACGCA AATCGAATCT CAATTACCCG CAGGTTATAA GCCGGTTTAT CTTAACCAGC TTCAACTGTT GTATGCCGCA CGCGATATGC AACCCATGTG GGAAAACCGT GATGCTGTTA AAGCCTTCCA GCAACAGCTG GCAGAGGTGG CGATTGCCGG TTTCCAGCCG CAGTTTAATA AATGGGTAGA GTTACTGACC GATCCTGGTG TTAACGGGAT GGCACGCGAC GTCGTGCTCT CTGATGCGAT GATGGGCTAT CTCCATTTCA TTGCAAATAT TCCGGTCAAA GGCACTCGCT GGCTATATAG CAGTAAACCT TATGCGCTTT CAACGCCGCC GCTTTCGGTG ATTAACCAAT GGCAGCTGGC GCTGGATAAA GGTCAATTGC CGACGTTTGT TGCAGGTCTG GCACCGCAGC ATCCGCAATA TGCAGCGATG CATGAATCGT TACTGGCCTT ACTCAGTGAC ACCAAACCGT GGCCCCAACT GACCGGCAAA GCAACGTTGC GCCCAGGGCA GTGGAGTAAC GACGTACCGG CGTTGCGCGA AATATTGCAA CGCACAGGCA TGCTGGACGG GGGGCCGAAA ATTACTCTAC CTGGCGATGA CACGCCAACT GACGCGGTAG TCAGCCCATC CGCTGTTACT GTTGAAACAG CAGAAACTAA GCCGATGGAC AAGCAAACGA CGTCTCGTAG TAAACCTGCG CCTGCCGTTC GTGCCGCCTA CGATAATGAA CTGGTGGAAG CCGTTAAACG TTTTCAGGCA TGGCAAGGAT TGGGAGCAGA TGGCGCTATT GGCCCGGCAA CGCGTGACTG GTTAAACGTA ACGCCCGCCC AGCGTGCTGG CGTGTTGGCA CTCAACATCC AGCGATTGCG CTTGCTGCCA ACAGAGCTTT CTACCGGGAT CATGGTTAAC ATTCCGGCCT ATTCGCTGGT CTACTATCAG AACGGCAATC AGGTGCTGGA TTCGCGAGTC ATTGTCGGTC GCCCCGATCG CAAAACGCCG ATGATGAGCA GTGCACTTAA CAACGTAGTG GTAAACCCGC CGTGGAACGT ACCACCAACT CTGGCACGCA AAGATATTCT GCCGAAAGTG CGCAACGATC CGGGATATCT CGAAAGCCAT GGCTATACGG TGATGCGCGG CTGGAACAGC AGAGAAGCGA TTGACCCATG GCAGGTTGAC TGGTCTACAA TCACGGCCTC GAATTTACCG TTCCGCTTCC AGCAGGCTCC AGGCCCACGG AACTCGCTGG GGCGCTATAA ATTCAATATG CCGAGTTCAG AGGCCATTTA TTTGCATGAC ACGCCGAACC ACAATCTGTT CAAACGTGAT ACACGCGCAT TGAGCTCAGG CTGTGTGCGA GTGAATAAAG CTTCCGATCT GGCGAATATG CTGTTGCAGG ATGCAGGATG GAATGACAAA CGTATTTCTG ATGCGCTGAA GCAGGGCGAT ACGCGTTACG TCAATATTCG GCAGTCGATT CCGGTGAATC TCTACTACCT GACGGCCTTT GTTGGTGCAG ATGGTCGTAC CCAGTATCGT ACAGATATTT ACAATTATGA TCTGCCTGCG CGATCCAGCT CGCAAATCGT ATCGAAAGCG GAACAATTAA TCAGGTAA
|
Protein sequence | MLLNMMCGRR LSAISLCVAV TFAPLFNAQA DEPEVIPGDS PVAVSEQGEA LPQAQATAIM AGIQPLPEGA AEKARTQIES QLPAGYKPVY LNQLQLLYAA RDMQPMWENR DAVKAFQQQL AEVAIAGFQP QFNKWVELLT DPGVNGMARD VVLSDAMMGY LHFIANIPVK GTRWLYSSKP YALSTPPLSV INQWQLALDK GQLPTFVAGL APQHPQYAAM HESLLALLSD TKPWPQLTGK ATLRPGQWSN DVPALREILQ RTGMLDGGPK ITLPGDDTPT DAVVSPSAVT VETAETKPMD KQTTSRSKPA PAVRAAYDNE LVEAVKRFQA WQGLGADGAI GPATRDWLNV TPAQRAGVLA LNIQRLRLLP TELSTGIMVN IPAYSLVYYQ NGNQVLDSRV IVGRPDRKTP MMSSALNNVV VNPPWNVPPT LARKDILPKV RNDPGYLESH GYTVMRGWNS REAIDPWQVD WSTITASNLP FRFQQAPGPR NSLGRYKFNM PSSEAIYLHD TPNHNLFKRD TRALSSGCVR VNKASDLANM LLQDAGWNDK RISDALKQGD TRYVNIRQSI PVNLYYLTAF VGADGRTQYR TDIYNYDLPA RSSSQIVSKA EQLIR
|
| |