Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4607 |
Symbol | dcuA |
ID | 6147437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4710439 |
End bp | 4711740 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641619423 |
Product | anaerobic C4-dicarboxylate transporter |
Protein accession | YP_001746534 |
Protein GI | 170681409 |
COG category | [R] General function prediction only |
COG ID | [COG2704] Anaerobic C4-dicarboxylate transporter |
TIGRFAM ID | [TIGR00770] anaerobic c4-dicarboxylate membrane transporter family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.104112 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.747742 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAGTTG TAGAACTCAT CATAGTTTTG CTGGCGATCT TCTTGGGCGC CAGATTGGGG GGAATAGGTA TTGGTTTTGC AGGCGGATTG GGGGTGCTGG TTCTTGCCGC TATTGGCGTT AAACCCGGTA ACATCCCGTT CGATGTCATT TCCATTATCA TGGCGGTTAT CGCCGCTATT TCTGCCATGC AGGTTGCTGG CGGTCTGGAC TATCTGGTTC ATCAGACAGA AAAGCTGCTG CGCCGTAACC CGAAATACAT CACGATCCTC GCACCGATCG TGACCTATTT CCTGACTATC TTTGCTGGTA CTGGCAACAT CTCTCTGGCG ACACTGCCAG TTATCGCTGA AGTTGCGAAG GAACAAGGCG TTAAACCTTG CCGTCCGCTG TCTACTGCAG TGGTATCCGC GCAGATTGCG ATCACCGCAT CGCCAATCTC AGCAGCAGTG GTTTACATGT CTTCCGTGAT GGAAGGTCAT GGCATCAGCT ACCTCCATCT GCTCTCCGTG GTCATCCCGT CCACCCTGCT GGCGGTTCTG GTGATGTCCT TCCTGGTCAC TATGCTGTTC AACTCCAAAC TCTCTGACGA TCCGATTTAT CGCAAGCGTC TGGAAGAGGG CCTGGTTGAA CTGCGCGGTG AAAAGCAGAT TGAAATCAAA TCCGGTGCAA AAACGTCCGT CTGGCTGTTC CTGCTGGGCG TAGTTGGCGT GGTTATCTAT GCAATCATCA ACAGCCCAAG CATGGGTCTG GTTGAAAAAC CGCTGATGAA CACCACCAAC GCAATCCTGA TCATCATGCT CAGCGTTGCA ACTCTGACCA CCGTTATCTG TAAAGTCGAT ACCGACAACA TCCTCAACTC CAGCACCTTC AAAGCAGGTA TGAGCGCCTG TATTTGTATC CTGGGTGTTG CGTGGCTGGG CGATACTTTC GTTTCCAACA ACATCGACTG GATCAAAGAT ACCGCTGGTG AAGTGATTCA GGGTCATCCG TGGCTGCTGG CCGTCATCTT CTTCTTTGCT TCTGCTCTGC TGTACTCTCA GGCTGCAACC GCAAAAGCAC TGATGCCGAT GGCTCTGGCA CTGAACGTTT CTCCGCTGAC CGCTGTTGCT TCTTTCGCTG CGGTGTCTGG TCTGTTCATT CTGCCGACCT ACCCGACGCT GGTTGCTGCG GTACAGATGG ATGACACGGG TACTACCCGT ATCGGTAAAT TCGTCTTCAA CCATCCGTTC TTCATCCCGG GTACTCTGGG TGTTGCCCTG GCCGTTTGCT TCGGCTTCGT GCTGGGTAGC TTCATGCTGT AA
|
Protein sequence | MLVVELIIVL LAIFLGARLG GIGIGFAGGL GVLVLAAIGV KPGNIPFDVI SIIMAVIAAI SAMQVAGGLD YLVHQTEKLL RRNPKYITIL APIVTYFLTI FAGTGNISLA TLPVIAEVAK EQGVKPCRPL STAVVSAQIA ITASPISAAV VYMSSVMEGH GISYLHLLSV VIPSTLLAVL VMSFLVTMLF NSKLSDDPIY RKRLEEGLVE LRGEKQIEIK SGAKTSVWLF LLGVVGVVIY AIINSPSMGL VEKPLMNTTN AILIIMLSVA TLTTVICKVD TDNILNSSTF KAGMSACICI LGVAWLGDTF VSNNIDWIKD TAGEVIQGHP WLLAVIFFFA SALLYSQAAT AKALMPMALA LNVSPLTAVA SFAAVSGLFI LPTYPTLVAA VQMDDTGTTR IGKFVFNHPF FIPGTLGVAL AVCFGFVLGS FML
|
| |