Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1122 |
Symbol | |
ID | 6144736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1140962 |
End bp | 1142278 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641616002 |
Product | anaerobic C4-dicarboxylate transporter |
Protein accession | YP_001743194 |
Protein GI | 170681956 |
COG category | [R] General function prediction only |
COG ID | [COG2704] Anaerobic C4-dicarboxylate transporter |
TIGRFAM ID | [TIGR00770] anaerobic c4-dicarboxylate membrane transporter family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.253067 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCTC TATTTTTTAT ACAACTATTT ATTGTTCTGG CCTGTATTGG AATCGGCGGA AAATACGGCG GAATGGGGCT TGGTGCTGGA GGTGGGTTAG GGGTTGCAAT ATTGGTTTTG GGCTTTGGAC TGAAACCCTC CTCTCCCTCC ATTACTGCTA TGTTAATTAT TATTTGTGTT ATTAGTGCTG TCAGTATATT ACAAGCTGTT GGAGGATTAG ATTATTTAGT CAGGGTCGCA GAACGGATAC TCCGTAAGAA ACCACAGGCG ATAACCTTTG TAGCGCCAAT ATTAACTTCA GTATTTACAC TATTTTGTGG GACGACTTAT GTCGCTTTCT CCTTATACCC TGTTATTGCT GAAGTCGCCG CTGAAGCAAA AGTACGTCCA GAACGTGCAT TATCTGCAAC GGTCATTGCA GCCAGTGTGG CAGTGGCTGC CAGCCCTATG AGTGCAGCAA CTGCCGGGAT GCTGGCTATA CTTCATGAGT ATGCAGGGAT CACGTTAGGG CAAATATTAT CTATTGCCCT TCCTTCGTTT TTTATGGCTG CAATTGTTAC CAGCTTTTCA GTCTACAAGC GTGGTAAAGA ACTGGAAGAT GATCCTGAAT TTCAAAGGCG TGTTGCAGCA GGTGAGTATG AGTTCATGCA CACCGAACAG AAAAAAGAAT ATGTCGCTGC ACCAGGTGCA AGAAAAGGGG TTGCTATTTT TGCCATAGGA GTCGTGCTGG TCCTTATTCT CGGATCTTTT ACTGAGCTTC TTCCGTCGTG GGATGGAAAA AGATTATCAA CCCCAATGGT CATCCAGATG ATTATGTTGA CTGCAGCTTT ATTGATCATG ATTGTGGGGA AAGTTCCAAG CAATAAATTT AACAGTGGCT CTGTGTTCCG TGCTGGTTTG ATGGGGGTTG TGGCAATACT TGGTGTTTCG TGGATGACTG CAACATTTTT TGATGCTTAT CAGGCAGAGC TGATCAATGT TTTTGGCAAT CTTGTAAATG ATGCGCCTTT GCTGTTTGGC TTTATCGTAT TCTTGTTTTC TCTGGTGATC ATGAGCCCGG CTGCGACAGT AGCTGCGATT ATGCCATTAG GAGTCACCCT GGGTATTCCT GCACCTTATC TTATTGCAAT TTTTGCCTGT ACTTGTGGTG ATTTTATTAT ACCCGGAGCG AATCAGATTG GTTGTGTGGC GTTTGACAGA ACTGGTACTA CCAGAATTGG GCGCTTTGTT ATCAACCATA GCTATATACG TCCTGGGTTC GTCATGGTCA TTTCCCAGGT TGTTTTTGCC TACTTAATTG CGCAGGTTAT TCTGTAA
|
Protein sequence | MDALFFIQLF IVLACIGIGG KYGGMGLGAG GGLGVAILVL GFGLKPSSPS ITAMLIIICV ISAVSILQAV GGLDYLVRVA ERILRKKPQA ITFVAPILTS VFTLFCGTTY VAFSLYPVIA EVAAEAKVRP ERALSATVIA ASVAVAASPM SAATAGMLAI LHEYAGITLG QILSIALPSF FMAAIVTSFS VYKRGKELED DPEFQRRVAA GEYEFMHTEQ KKEYVAAPGA RKGVAIFAIG VVLVLILGSF TELLPSWDGK RLSTPMVIQM IMLTAALLIM IVGKVPSNKF NSGSVFRAGL MGVVAILGVS WMTATFFDAY QAELINVFGN LVNDAPLLFG FIVFLFSLVI MSPAATVAAI MPLGVTLGIP APYLIAIFAC TCGDFIIPGA NQIGCVAFDR TGTTRIGRFV INHSYIRPGF VMVISQVVFA YLIAQVIL
|
| |