Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4892 |
Symbol | dctA |
ID | 6971169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4527376 |
End bp | 4528662 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643388580 |
Product | C4-dicarboxylate transporter DctA |
Protein accession | YP_002273008 |
Protein GI | 209398499 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.321398 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACCT CTCTGTTTAA AAGCCTTTAC TTTCAGGTCC TGACAGCGAT AGCCATTGGT ATTCTCCTTG GCCATTTCTA TCCTGAAATA GGCGAGCAAA TGAAACCGCT TGGCGACGGC TTCGTTAAGC TCATTAAGAT GATCATCGCT CCTGTCATCT TTTGTACCGT CGTAACGGGC ATTGCGGGCA TGGAAAGCAT GAAGGCGGTC GGTCGTACCG GCGCAGTCGC ACTGCTTTAC TTTGAAATTG TCAGTACCAT CGCGCTGATT ATTGGTCTTA TCATCGTTAA CGTCGTGCAG CCTGGTGCCG GAATGAACGT CGATCCGGCA ACGCTTGATG CGAAAGCGGT AGCGGTTTAC GCCGATCAGG CGAAAGACCA GGGCATTGTC GCCTTCATTA TGGATGTCAT CCCGGCGAGC GTCATTGGCG CATTTGCCAG CGGTAACATT CTGCAGGTGC TGCTGTTTGC CGTACTGTTT GGTTTTGCGC TCCACCGTCT GGGCAGCAAA GGCCAACTGA TTTTTAACGT CATCGAAAGT TTCTCGCAGG TCATCTTCGG CATCATCAAT ATGATCATGC GTCTGGCACC TATTGGTGCG TTCGGGGCAA TGGCGTTTAC CATCGGTAAA TACGGCGTCG GCACACTGGT GCAACTGGGG CAGCTGATTA TCTGTTTCTA CATTACCTGT ATCCTGTTTG TGGTGCTGGT ATTGGGTTCA ATCGCTAAAG CGACTGGTTT CAGTATCTTC AAATTTATCC GCTACATCCG TGAAGAACTG CTGATTGTAC TGGGGACTTC ATCTTCCGAG TCGGCGCTGC CGCGTATGCT CGACAAGATG GAGAAACTCG GCTGCCGTAA ATCGGTGGTG GGGCTGGTCA TCCCGACAGG CTACTCGTTT AACCTTGATG GCACATCGAT ATACCTGACA ATGGCGGCGG TGTTTATCGC CCAGGCCACT AACAGTCAGA TGGATATCGT CCACCAAATC ACGCTGTTAA TCGTGTTGCT GCTTTCTTCT AAAGGGGCGG CAGGGGTAAC GGGTAGTGGC TTTATCGTGC TGGCGGCGAC GCTCTCTGCG GTGGGCCATT TGCCGGTAGC GGGTCTGGCG CTGATCCTCG GTATCGACCG CTTTATGTCA GAAGCTCGTG CGCTGACTAA CCTGGTCGGT AACGGCGTAG CGACCATTGT CGTTGCTAAG TGGGTGAAAG AACTGGACCA CAAAAAACTG GACGATGCGC TGAATAATCG TGCGCCGGAT GGCAAAACGC ACGAATTATC CTCTTAA
|
Protein sequence | MKTSLFKSLY FQVLTAIAIG ILLGHFYPEI GEQMKPLGDG FVKLIKMIIA PVIFCTVVTG IAGMESMKAV GRTGAVALLY FEIVSTIALI IGLIIVNVVQ PGAGMNVDPA TLDAKAVAVY ADQAKDQGIV AFIMDVIPAS VIGAFASGNI LQVLLFAVLF GFALHRLGSK GQLIFNVIES FSQVIFGIIN MIMRLAPIGA FGAMAFTIGK YGVGTLVQLG QLIICFYITC ILFVVLVLGS IAKATGFSIF KFIRYIREEL LIVLGTSSSE SALPRMLDKM EKLGCRKSVV GLVIPTGYSF NLDGTSIYLT MAAVFIAQAT NSQMDIVHQI TLLIVLLLSS KGAAGVTGSG FIVLAATLSA VGHLPVAGLA LILGIDRFMS EARALTNLVG NGVATIVVAK WVKELDHKKL DDALNNRAPD GKTHELSS
|
| |