Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4000 |
Symbol | |
ID | 6144966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4079665 |
End bp | 4081047 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641618825 |
Product | putative transporter |
Protein accession | YP_001745964 |
Protein GI | 170681221 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2211] Na+/melibiose symporter and related transporters |
TIGRFAM ID | [TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.947516 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGTG AAGTGTTGTC CGTTAAAGAG AAAATTGGTT ATGGCATGGG AGACGCCGCC AGCCACATTA TTTTCGATAA CGTAATGTTA TATATGATGT TCTTTTATAC CGATATTTTT GGCATTCCTG CCGGATTTGT CGGAACCATG TTTTTGGTCG CTCGTGCACT GGATGCGATT TCCGATCCTT GCATGGGGTT GTTGGCCGAT CGAACGCGCT CTCGCTGGGG TAAATTTCGT CCGTGGGTAC TGTTTGGCGC ACTGCCATTT GGGATCGTCT GTGTACTGGC CTATAGCACG CCAGATCTCA GTATGAACGG AAAAATGATT TATGCAGCAA TTACTTACAC CCTACTTACC TTACTTTATA CCATCGTTAA TATCCCTTAC TGCGCATTGG GTGGTGTAAT CACCAATGAC CCGACTCAGC GTATCTCGCT GCAATCCTGG CGTTTTGTGC TGGCGACTGC GGGAGGCATG CTCTCTACTG TTCTGATGAT GCCGCTGGTT AATTTAATTG GTGGTGATAA TAAACCACTC GGTTTCCAGG GCGGTATCGC GGTCCTTTCC GTGGTGGCAT TCATGATGCT GGCATTTTGT TTCTTCACCA CTAAAGAACG CGTTGAAGCA CCACCTACAA CGACGTCTAT GAGGGAAGAT TTACGTGATA TCTGGCAAAA CGACCAGTGG CGTATTGTCG GTTTACTAAC CATTTTCAAT ATTCTGGCGG TATGCGTACG CGGCGGGGCG ATGATGTATT ACGTCACATG GATTTTGGGT ACGCCGGAAG TGTTTGTCGC TTTTCTTACC ACTTATTGCG TGGGTAACCT GATTGGTTCC GCACTGGCGA AACCGCTGAC CGACTGGAAA TGTAAAGTCA CTATCTTCTG GTGGACGAAC GCCCTGCTGG CAGTGATTAG CCTCGCGATG TTCTTTGTTC CTATGCAGGC CAGCATCACC ATGTTTGTCT TCATCTTCGT GATTGGTGTG TTGCATCAAC TGGTGACACC TATCCAGTGG GTAATGATGT CCGATACCGT CGACTACGGC GAGTGGTGTA ATGGTAAACG CCTGACCGGG ATCAGTTTTG CTGGCACGCT GTTTGTGCTC AAACTGGGGT TGGCCTTCGG CGGCGCTCTT ATCGGCTGGA TGCTGGCTTA TGGCGGATAT GATGCGGCAG AAAAAGCGCA GAACAGCGCC ACGATTAGCA TCATTATCGC GCTATTCACG ATTGTTCCGG CGATCTGTTA TTTGCTGAGC GCGATTATCG CTAAACGCTA CTACTCACTC ACGACGCACA ATCTGAAAAC CGTTATGGAA CAGCTGGCCC AGGGCAAACG CCGTTGCCAG CAACAATTCA CCTCTCAAGA AGTGCAGAAC TAA
|
Protein sequence | MKSEVLSVKE KIGYGMGDAA SHIIFDNVML YMMFFYTDIF GIPAGFVGTM FLVARALDAI SDPCMGLLAD RTRSRWGKFR PWVLFGALPF GIVCVLAYST PDLSMNGKMI YAAITYTLLT LLYTIVNIPY CALGGVITND PTQRISLQSW RFVLATAGGM LSTVLMMPLV NLIGGDNKPL GFQGGIAVLS VVAFMMLAFC FFTTKERVEA PPTTTSMRED LRDIWQNDQW RIVGLLTIFN ILAVCVRGGA MMYYVTWILG TPEVFVAFLT TYCVGNLIGS ALAKPLTDWK CKVTIFWWTN ALLAVISLAM FFVPMQASIT MFVFIFVIGV LHQLVTPIQW VMMSDTVDYG EWCNGKRLTG ISFAGTLFVL KLGLAFGGAL IGWMLAYGGY DAAEKAQNSA TISIIIALFT IVPAICYLLS AIIAKRYYSL TTHNLKTVME QLAQGKRRCQ QQFTSQEVQN
|
| |