Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3060 |
Symbol | |
ID | 6144830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3149691 |
End bp | 3150734 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641617929 |
Product | solute-binding family 7 protein |
Protein accession | YP_001745080 |
Protein GI | 170679959 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.591808 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.298604 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACTACC GACAATTACT CCCCTTTCTT TTATCAGGAT TTCTTCTGTT ATCTCCCTCG ATTTTTGCTG CTGAAAAAAT TCTTCGTTAT ACAGACCATG AACCCTACGG GGGAATGCGT ACACAAATAA TTAAAGAAAT ATTTTTTGCA GAAATTGAAA AAGAGTCGCA GGGGCGTTTG AAAATCGAAC CACACTGGAA CGGTGAAACC GCCATCAGCT ATGACGCTCT GACGACAATA AGCGATGGCA GCAAAGCTGA TATGGGTATA GTGGTGCCGG AATACACCGC GAAACAATTG CCGCTTCATC AAATCTTCAA GAGCTTTGCT ATTGGCCCGG ATCATGGAGC CAGTCAGGTA GAATTCTTTC GTCGCGTATA TGCGGAAATT CCCGAATTTA ACGCTGAACT TGAGCGTAAC AATATCGTGA ATTTACAGTT TTTCCTTGGC TACCCGGTAG GCTTTTTCTC TACCAGGCCC ATTGATAAAT TGACTGCGCT TCAGGGAACC ACCTGGCGAA CAGCCAGTTT CTGGCATCGG GCTTATTTAA CTCATACGGG GGCAAAAACC GTAACTTTAC CGTGGAATGA TCAAATAACT AAAGCACTCA TGGATGGAAA ACTGGATGGT TTAATGGTCA ATCTCGATAG CGGATATGAC ATCCATGCTG AACGTGCTGC GCCGAATGTG TTGCTCTCAC CTTCTCTCTG GCTTGGTCAT GTTTATCTGT TGGTAATGAA TAAACAATCG TGGGAAAACC TTGATAACAG AGATCGTGAG GCTATTCAAC GAGCTGCCAT TACAACCGAG AAAGCACTGG GCAAGGCATT AGATAACAAC CTGATCAGCA TGGTAAAAAC GCTTGAGCAG GAAGGTGCAC AGGTTCGCTA TCTGAAAAAA TCAGGGCTGG ACGCCTGGCA GAAAGCGATC GGTTATCAGC AAGAACAAGC ACAGTGGGTA GAAAAGCAAA ATAAGGAAGG CGTGGAGAAA GCCGGGGAAG TCATGCAAAA AGTTGCCAAT ATACTCGATG AAACAATGCG TTAA
|
Protein sequence | MHYRQLLPFL LSGFLLLSPS IFAAEKILRY TDHEPYGGMR TQIIKEIFFA EIEKESQGRL KIEPHWNGET AISYDALTTI SDGSKADMGI VVPEYTAKQL PLHQIFKSFA IGPDHGASQV EFFRRVYAEI PEFNAELERN NIVNLQFFLG YPVGFFSTRP IDKLTALQGT TWRTASFWHR AYLTHTGAKT VTLPWNDQIT KALMDGKLDG LMVNLDSGYD IHAERAAPNV LLSPSLWLGH VYLLVMNKQS WENLDNRDRE AIQRAAITTE KALGKALDNN LISMVKTLEQ EGAQVRYLKK SGLDAWQKAI GYQQEQAQWV EKQNKEGVEK AGEVMQKVAN ILDETMR
|
| |