Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0826 |
Symbol | |
ID | 6972161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 848762 |
End bp | 850042 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643384851 |
Product | transporter, dicarboxylate/amino acid:cation family |
Protein accession | YP_002269357 |
Protein GI | 209398852 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAA TAAGTTTAAC CACGATGATT CTTTTGGCGC TGGTACTTGG AATGATTATC GGCGTAGTGC TCAATAACAC TGCTTCACCG GAAACCGCAA AACTCTATGC GCAAGAAATA TCGATATTCA CGACGATTTT CTTACGACTG ATAAAAATGA TTATCGCTCC GTTAGTGGTC TCTACCCTGG TGGTAGGTAT TGCTAAAATG GGCGATGCCA AAGCCCTTGG TCGTATTTTT TCTAAAACAC TCTTTTTATT TATTTGCGCC TCATTGCTGT CAATTGCCTT AGGCTTGATA ACGGTAAATT TCTTCATGCC AGGTACAGGA ATTAATTTTG TTGCACACGG TGCCGAAACC ACCGGAGTGG TCGCGGCAGA ACCCTTTACG CTAAAAGTAT TTATTTCGCA TGCTTTCCCC ACCAGCATTG TCGATGCCAT GGCGCACAAT GAAATTTTGC AAATCGTGGT GTTCTCAATT TTCCTCGGCT GTAGCCTGAC GGCGATTGGC GAGAAAGGCA GCGCCATCGT TCACGCCTTA GATTCGCTGG CACATGCCAT GTTAAAGCTC ACTGGCTACG TCATGCTCTT CGCTCCCCTG ACCGTATTCG CTGCTATTTC AGCATTGATT GCTGAACGAG GACTGGCAGT TATGGTGAGC GCCGGGATCT TTATGGGTGA ATTTTATTTC ACCATGTTAT TACTTTGGGT ACTGCTTATC GGTCTGGCCA TCGTTTATGT CGGCCCCTGC ATCAGACGCC TGACCCGCGC CCTTTCGGAA CCCGCCCTGC TGGCATTTAC CACATCCAGT TCTGAAGCGG CTTTTCCGGG AACGCTTGAA AAACTGGAGC AATTTGGCGT TTCCCCCAAA ATTGCCAGCT TTGTCTTACC CATTGGCTAC TCATTTAATC TCGTTGGATC AATGGCCTAC TGCTCCTTCG CCACAGTTTT CATCGCCCAG GCCTGCAATA TCCATTTATC CATCGGTGAG CAAATCACCA TGCTGTTGAT CCTGATGTTG ACCTCGAAAG GAATGGCTGG CGTACCACGC GCCTCAATGG TGGTTATCGC CGCCACGCTC AACCAGTTCA ATATTCCGGA AGCGGGGCTG ATCTTGCTGA TGGGCGTTGA TCCGTTCCTT GATATGGGGC GTTCCGCGAC AAACGTCATG AGCAACGCAA TGGGCGCTGC GATGGTGAGT CGGTGGGAAG GCGAACATTT CGGCGAGGGC TGTCGGGGTA AAGCATTAAA ACCCAATGAA TCGAACGTTG CTCTGCCCTG A
|
Protein sequence | MKKISLTTMI LLALVLGMII GVVLNNTASP ETAKLYAQEI SIFTTIFLRL IKMIIAPLVV STLVVGIAKM GDAKALGRIF SKTLFLFICA SLLSIALGLI TVNFFMPGTG INFVAHGAET TGVVAAEPFT LKVFISHAFP TSIVDAMAHN EILQIVVFSI FLGCSLTAIG EKGSAIVHAL DSLAHAMLKL TGYVMLFAPL TVFAAISALI AERGLAVMVS AGIFMGEFYF TMLLLWVLLI GLAIVYVGPC IRRLTRALSE PALLAFTTSS SEAAFPGTLE KLEQFGVSPK IASFVLPIGY SFNLVGSMAY CSFATVFIAQ ACNIHLSIGE QITMLLILML TSKGMAGVPR ASMVVIAATL NQFNIPEAGL ILLMGVDPFL DMGRSATNVM SNAMGAAMVS RWEGEHFGEG CRGKALKPNE SNVALP
|
| |