Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4375 |
Symbol | |
ID | 6971238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4050638 |
End bp | 4052101 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643388098 |
Product | anion transporter |
Protein accession | YP_002272536 |
Protein GI | 209400804 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0471] Di- and tricarboxylate transporters |
TIGRFAM ID | [TIGR00785] anion transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00267264 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 80 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCTT CCACTGAATG GTGGCGATAC CTTGCGCCGC TGGCGGTCAT CGCCATTATT GCTCTAATTC CGGTTCCCGC AGGGCTGGAG AGTCATACCT GGCTCTACTT TGCCGTTTTT ACTGGCGTAA TCGTTGGACT GATCCTCGAA CCCGTGCCGG GTGCCGTGGT GGCGATGGTG GGTATCTCCA TCATCGCCAT ACTCTCTCCC TGGCTGCTGT TCAGCCCGGA GCAGCTCGCT CAGCCAGGCT TTAAATTCAC TGCAAAATCC CTCTCGTGGG CCGTTTCCGG TTTTTCTAAT TCGGTTATCT GGCTGATTTT CGCCGCCTTT ATGTTTGGCA CAGGCTATGA AAAAACCGGG CTGGGGCGAC GTATCGCGCT GATTCTGGTG AAAAAGATGG GGCATCGCAC GCTGTTTCTT GGCTATGCGG TGATGTTCTC CGAGCTGATC CTGGCACCTG TCACACCGTC CAACTCGGCG CGTGGTGCGG GGATTATCTA TCCCATCATC CGTAACCTGC CGCCGCTCTA TCAATCACAA CCAAACGACA GCAGTTCGCG CAGTATTGGC TCGTACATCA TGTGGATGGG GATTGTTGCC GACTGCGTGA CCAGCGCCAT TTTCCTGACG GCGATGGCTC CTAACTTGCT GTTAATTGGA CTGATGAAAA GCGCATCTCA CGCCACGCTG AGTTGGGGCG ACTGGTTCCT CGGGATGTTG CCGCTCAGTA TTTTACTGGT CCTGCTGGTT CCCTGGCTGG CTTACGTGCT GTACCCGCCT GTACTGAAGT CAGGCGATCA GGTGCCGCGC TGGGCAGAGA CGGAACTGCA GGCAATGGGC CCGCTCTGTT CGCGTGAAAA ACGGATGCTG GGGCTGATGG TAGGCGCGCT GGTGCTGTGG ATTTTCGGCG GTGATTATAT CGATGCCGCG ATGGTCGGTT ACAGCGTGGT GGCGCTGATG CTGCTTCTGC GCATTATCAG TTGGGACGAT ATTGTCAGTA ATAAAGCGGC GTGGAACGTT TTCTTCTGGC TGGCCTCGCT TATCACCCTC GCGACCGGAC TCAACAACAC CGGTTTTATT AGCTGGTTTG GCAAACTGTT AGCAGGCAGC TTAAGCGGTT ATTCGCCAAC GATGGTGATG GTGGCGTTGA TTGTGGTGTT TTATCTACTG CGCTACTTTT TTGCCAGCGC CACGGCGTAT ACCTCCGCTC TCGCACCGAT GATGATTGCC GCTGCGCTGG CGATGCCGGA AATCCCGCTG CCGGTGTTCT GCCTGATGGT TGGTGCGGCA ATTGGTCTGG GGAGCATTCT TACACCATAC GCCACCGGCC CCAGTCCGAT TTACTACGGC AGTGGTTATC TGCCAACGGC GGATTACTGG CGACTGGGGG CGATTTTTGG GCTGATATTC CTCGTATTGC TGGTGATTAC CGGCTTACTG TGGATGCCCG TGGTGTTGCT TTAA
|
Protein sequence | MKPSTEWWRY LAPLAVIAII ALIPVPAGLE SHTWLYFAVF TGVIVGLILE PVPGAVVAMV GISIIAILSP WLLFSPEQLA QPGFKFTAKS LSWAVSGFSN SVIWLIFAAF MFGTGYEKTG LGRRIALILV KKMGHRTLFL GYAVMFSELI LAPVTPSNSA RGAGIIYPII RNLPPLYQSQ PNDSSSRSIG SYIMWMGIVA DCVTSAIFLT AMAPNLLLIG LMKSASHATL SWGDWFLGML PLSILLVLLV PWLAYVLYPP VLKSGDQVPR WAETELQAMG PLCSREKRML GLMVGALVLW IFGGDYIDAA MVGYSVVALM LLLRIISWDD IVSNKAAWNV FFWLASLITL ATGLNNTGFI SWFGKLLAGS LSGYSPTMVM VALIVVFYLL RYFFASATAY TSALAPMMIA AALAMPEIPL PVFCLMVGAA IGLGSILTPY ATGPSPIYYG SGYLPTADYW RLGAIFGLIF LVLLVITGLL WMPVVLL
|
| |