Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4541 |
Symbol | nanT1 |
ID | 6967654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4208098 |
End bp | 4209588 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643388253 |
Product | putative sialic acid transporter |
Protein accession | YP_002272688 |
Protein GI | 209397614 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00891] putative sialic acid transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.315393 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACTA CAACCCAGAA TATCCCGTGG TATCGCCATC TCAATCGTGC ACAATGGCGG GCATTTTCCG CTGCCTGGTT GGGATATCTG CTTGACGGTT TTGATTTCGT TTTAATCGCC CTGGTACTCA CCGAAGTACA AGGTGAATTC GGGCTGACGA CGGTGCAGGC GGCAAGTCTG ATCTCCGCAG CCTTTATCTC TCGCTGGTTC GGCGGCCTGA TGCTCGGCGC TATGGGTGAC CGCTACGGGC GTCGTCTGGC AATGGTCACC AGCATCGTTC TCTTCTCGGC CGGGACGCTG GCATGCGGCT TTGCGCCAGG CTACATCACC ATGTTTATCG CTCGTCTGGT CATCGGCATG GGGATGGCGG GTGAATACGG TTCCAGCGCC ACCTATGTCA TTGAAAGCTG GCCAAAACAT CTGCGTAACA AAGCCAGTGG CTTTCTGATT TCAGGCTTCT CTGTAGGGGC CGTCGTTGCC GCTCAGGTCT ACAGCCTGGT GGTTCCGGTC TGGGGCTGGC GTGCGCTGTT CTTTATCGGC ATTTTGCCAA TCATCTTTGC TCTCTGGCTG CGTAAAAACA TCCCGGAAGC AGAAGACTGG AAAGAGAAAC ACGCAGGTAA AGCACCAGTA CGCACAATGG TGGATATTCT CTACCGTGGT GAACATCGCA TTGCCAATAT CGTAATGACA CTGGCGGCGG CTACTGCGCT GTGGTTCTGC TTCGCCGGTA ACCTGCAAAA TGCCGCGATC GTCGCTGTTC TTGGGCTGTT ATGCACCGCA ATCTTTATCA GCTTTATGGT GCAGAGTACG GGCAAACGCT GGCCAACTGG CGTAATGCTG ATGGTGGTCG TGTTGTTTGC TTTCCTCTAC TCATGGCCGA TTCAGGCGCT GCTGCCAACG TATCTGAAAA CCGATCTGGC TTATAACCCT CATACTGTCG CCAATGTGCT GTTCTTTAGT GGCTTTGGCG CGGCGGTGGG GTGCTGCGTC GGTGGCTTCC TCGGTGACTG GCTGGGAACC CGCAAAGCGT ACGTTTGTAG TCTGTTGGCC TCGCAGCTGC TGATTATTCC GGTATTTGCT ATTGGCGGTG CAAACGTCTG GGTGCTCGGT CTGTTACTGT TCTTCCAGCA AATGCTTGGA CAAGGGATCG CCGGGATCTT ACCGAAACTG ATTGGCGGTT ATTTCGATAC CGACCAGCGT GCAGCGGGCC TGGGCTTTAC CTATAACGTT GGCGCATTGG GTGGCGCACT GGCCCCAATC ATCGGCGCGT TGATCGCTCA ACGTCTGGAT CTGGGTACTG CACTGGCATC GCTCTCGTTC AGTCTGACAT TTGTGGTGAT CCTGCTGATT GGTCTGGATA TGCCTTCTCG TGTTCAGCGT TGGCTGCGCC CGGAAGCGTT GCGTACTCAT GACGCTATCG ACGGCAAACC ATTTAGCGGT GCCGTGCCGT TTGGCAGCGC CAAAAACGAT TTAGTCAAAA CCAAAAGTTA A
|
Protein sequence | MSTTTQNIPW YRHLNRAQWR AFSAAWLGYL LDGFDFVLIA LVLTEVQGEF GLTTVQAASL ISAAFISRWF GGLMLGAMGD RYGRRLAMVT SIVLFSAGTL ACGFAPGYIT MFIARLVIGM GMAGEYGSSA TYVIESWPKH LRNKASGFLI SGFSVGAVVA AQVYSLVVPV WGWRALFFIG ILPIIFALWL RKNIPEAEDW KEKHAGKAPV RTMVDILYRG EHRIANIVMT LAAATALWFC FAGNLQNAAI VAVLGLLCTA IFISFMVQST GKRWPTGVML MVVVLFAFLY SWPIQALLPT YLKTDLAYNP HTVANVLFFS GFGAAVGCCV GGFLGDWLGT RKAYVCSLLA SQLLIIPVFA IGGANVWVLG LLLFFQQMLG QGIAGILPKL IGGYFDTDQR AAGLGFTYNV GALGGALAPI IGALIAQRLD LGTALASLSF SLTFVVILLI GLDMPSRVQR WLRPEALRTH DAIDGKPFSG AVPFGSAKND LVKTKS
|
| |