Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3243 |
Symbol | |
ID | 5592216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 3251470 |
End bp | 3252933 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640922361 |
Product | anion transporter |
Protein accession | YP_001459857 |
Protein GI | 157162539 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0471] Di- and tricarboxylate transporters |
TIGRFAM ID | [TIGR00785] anion transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.00000262372 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCTT CCACTGAATG GTGGCGATAC CTTGCGCCGC TGGCGGTCAT CGCCATTATT GCTCTAATTC CGGTTCCCGC AGGGCTGGAG AGTCATACCT GGCTTTACTT TGCCGTTTTT ACTGGCGTGA TCGTTGGACT GATCCTCGAA CCCGTGCCGG GTGCCGTGGT GGCGATGGTG GGTATATCCA TCATCGCCAT ACTCTCTCCC TGGCTGCTGT TCAGCCCGGA GCAGCTCGCT CAGCCAGGCT TTAAATTCAC TGCAAAATCC CTCTCGTGGG CCGTTTCCGG TTTTTCTAAT TCGGTTATCT GGCTGATTTT CGCCGCCTTT ATGTTTGGCA CAGGCTATGA AAAAACCGGG CTTGGACGCC GCATTGCGCT GATTCTGGTG AAAAAGATGG GGCATCGTAC GCTGTTTCTC GGCTATGCGG TGATGTTCTC CGAGCTGATC CTGGCACCTG TAACACCGTC CAACTCGGCG CGTGGTGCGG GGATTATCTA TCCCATCATC CGTAACCTGC CACCGCTCTA TCAATCACAA CCAAACGACA GCAGTTCACG CAGCATTGGC TCGTACATCA TGTGGATGGG GATTGTTGCC GACTGCGTGA CCAGCGCCAT TTTCCTGACG GCGATGGCAC CAAACTTGCT GTTAATTGGA CTGATGAAAA GTGCATCTCA CGCCACGCTG AGTTGGGGCG ACTGGTTCCT CGGGATGTTG CCGCTCAGCA TTTTACTGGT TCTGCTGGTT CCCTGGATGG CTTACGTGCT GTACCCACCG GTACTGAAGT CTGGTGATCA GGTGCCGCGC TGGGCAGAGA CGGAACTGCA GGCAATGGGC CCGCTCTGTT CTCGTGAAAA ACGGATGCTG GGGCTGATGG TAGGCGCGCT GGTGCTGTGG ATTTTCGGCG GTGATTATAT TGATGCTGCG ATGGTTGGTT ACAGCGTAGT GGCACTGATG CTGCTTCTGC GCATTATCTG CTGGGACGAC ATTGTCAGTA ATAAAGCGGC GTGGAACGTT TTCTTCTGGC TGGCCTCGCT TATCACCCTC GCTACCGGAC TCAACAACAC CGGTTTTATT AGCTGGTTTG GCAAACTGTT AGCAGGCAGC TTAAGTGGTT ATTCGCCGAC GATGGTGATG GTGGCATTGA TTGTGGTGTT TTATCTACTG CGCTACTTTT TCGCCAGCGC CACGGCGTAT ACCTCCGCCC TCGCACCGAT GATGATCGCC GCCGCGCTGG CGATGCCGGA AATCCCGCTG CCGGTGTTCT GCCTGATGGT TGGCGCGGCA ATTGGTCTGG GGAGCATTCT TACACCATAC GCCACCGGAC CCAGTCCGAT TTACTACGGT AGTGGTTATC TGCCAACGGC GGATTACTGG CGACTGGGGG CGATTTTTGG GCTGATATTC CTCGTATTGC TGGTGATTAC CGGCTTACTG TGGATGCCCG TGGTGTTGCT TTAA
|
Protein sequence | MKPSTEWWRY LAPLAVIAII ALIPVPAGLE SHTWLYFAVF TGVIVGLILE PVPGAVVAMV GISIIAILSP WLLFSPEQLA QPGFKFTAKS LSWAVSGFSN SVIWLIFAAF MFGTGYEKTG LGRRIALILV KKMGHRTLFL GYAVMFSELI LAPVTPSNSA RGAGIIYPII RNLPPLYQSQ PNDSSSRSIG SYIMWMGIVA DCVTSAIFLT AMAPNLLLIG LMKSASHATL SWGDWFLGML PLSILLVLLV PWMAYVLYPP VLKSGDQVPR WAETELQAMG PLCSREKRML GLMVGALVLW IFGGDYIDAA MVGYSVVALM LLLRIICWDD IVSNKAAWNV FFWLASLITL ATGLNNTGFI SWFGKLLAGS LSGYSPTMVM VALIVVFYLL RYFFASATAY TSALAPMMIA AALAMPEIPL PVFCLMVGAA IGLGSILTPY ATGPSPIYYG SGYLPTADYW RLGAIFGLIF LVLLVITGLL WMPVVLL
|
| |