Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3519 |
Symbol | nanT1 |
ID | 6143258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3598009 |
End bp | 3599499 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641618348 |
Product | putative sialic acid transporter |
Protein accession | YP_001745495 |
Protein GI | 170683321 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00891] putative sialic acid transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.434563 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACTA CAACCCAGAA TATCCCGTGG TATCGCCATC TTAACCGCGC ACAATGGCGG GCATTTTCCG CTGCCTGGTT GGGATATCTG CTTGACGGTT TTGATTTCGT TTTAATCGCC CTGGTACTCA CCGAAGTACA GGGTGAATTC GGGCTGACGA CGGTGCAGGC GGCAAGTCTG ATCTCCGCAG CCTTTATCTC TCGCTGGTTC GGCGGCCTGA TGCTCGGCGC TATGGGTGAC CGCTACGGGC GTCGTCTGGC AATGGTCACC AGCATCGTTC TCTTCTCGGC CGGGACGCTG GCCTGCGGCT TTGCGCCAGG CTACATCACC ATGTTTATCG CTCGTCTGGT CATCGGCATG GGGATGGCGG GTGAATACGG TTCCAGCGCC ACCTATGTCA TTGAAAGCTG GCCAAAGCAT CTGCGTAACA AAGCCAGTGG TTTTCTGATT TCCGGCTTCT CAGTGGGGGC CGTCGTTGCC GCTCAGGTCT ATAGCCTGGT GGTTCCGGTC TGGGGCTGGC GTGCGCTGTT CTTTATCGGC ATTTTGCCAA TCATCTTTGC TCTCTGGCTG CGTAAAAACA TCCCGGAAGC GGAAGACTGG AAAGAGAAAC ACGGGGGTAA AGCACCGGTA CGCACGATGG TAGACATTCT TTACCGTGGT GAACATCGCA TTGCCAATAT CGTAATGACA CTGGCGGCGG CTACTGCGCT GTGGTTCTGC TTCGCCGGTA ACCTGCAAAA TGCCGCGATC GTCGCTGTTC TTGGGCTGTT ATGCGCCGCA ATCTTTATCA GCTTTATGGT GCAGAGTACG GGCAAACGCT GGCCAACGGG CGTAATGCTG ATGGTGGTCG TGTTGTTTGC TTTCCTCTAC TCATGGCCGA TTCAGGCGCT GCTGCCAACG TATCTGAAAA CCGATCTGGC TTATGACCCG CATACTGTCG CCAATGTGCT GTTCTTTAGT GGCTTTGGCG CGGCGGTGGG GTGCTGCGTC GGTGGCTTCC TCGGTGACTG GCTGGGAACC CGCAAAGCGT ACGTTTGTAG TCTGCTGGCC TCGCAGCTGC TGATTATTCC GGTATTTGCG ATTGGCGGCG CAAACGTCTG GGTGCTCGGT CTGTTACTGT TCTTCCAGCA AATGCTTGGA CAAGGGATCG CCGGGATCTT ACCGAAACTG ATTGGCGGTT ATTTCGATAC CGACCAGCGT GCAGCGGGCC TGGGCTTTAC CTACAACGTT GGCGCATTGG GCGGCGCACT GGCCCCAATC ATCGGCGCGT TGATCGCTCA ACGTCTGGAT CTGGGTACTG CGCTGGCATC GCTCTCGTTC AGTTTGACGT TTGTGGTGAT CCTGCTGATT GGTCTGGATA TGCCTTCTCG CGTTCAGCGT TGGCTGCGCC CGGAAGCATT GCGTACTCAT GACGCTATCG ACGGCAAACC ATTTAGCGGT GCCGTGCCAT TTGGCAGCGC CAAAAACGAT TTAGTCAAAA CCAAAAGTTA A
|
Protein sequence | MSTTTQNIPW YRHLNRAQWR AFSAAWLGYL LDGFDFVLIA LVLTEVQGEF GLTTVQAASL ISAAFISRWF GGLMLGAMGD RYGRRLAMVT SIVLFSAGTL ACGFAPGYIT MFIARLVIGM GMAGEYGSSA TYVIESWPKH LRNKASGFLI SGFSVGAVVA AQVYSLVVPV WGWRALFFIG ILPIIFALWL RKNIPEAEDW KEKHGGKAPV RTMVDILYRG EHRIANIVMT LAAATALWFC FAGNLQNAAI VAVLGLLCAA IFISFMVQST GKRWPTGVML MVVVLFAFLY SWPIQALLPT YLKTDLAYDP HTVANVLFFS GFGAAVGCCV GGFLGDWLGT RKAYVCSLLA SQLLIIPVFA IGGANVWVLG LLLFFQQMLG QGIAGILPKL IGGYFDTDQR AAGLGFTYNV GALGGALAPI IGALIAQRLD LGTALASLSF SLTFVVILLI GLDMPSRVQR WLRPEALRTH DAIDGKPFSG AVPFGSAKND LVKTKS
|
| |