Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0709 |
Symbol | dcuC |
ID | 6969642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 739213 |
End bp | 740598 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643384744 |
Product | C4-dicarboxylate transporter DcuC |
Protein accession | YP_002269257 |
Protein GI | 209396097 |
COG category | [C] Energy production and conversion |
COG ID | [COG3069] C4-dicarboxylate transporter |
TIGRFAM ID | [TIGR00771] c4-dicarboxylate anaerobic carrier family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00726308 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGACAT TCATTGAACT CCTTATTGGG GTTGTGGTTA TTGTGGGTGT AGCTCGCTAC ATCATTAAAG GGTATTCCGC CACTGGTGTG TTATTTGTCG GTGGCCTGTT ATTGCTGATT ATCAGTGCCA TTATGGGGCA CAAAGTGTTA CCGTCCAGCC AGGCTTCAAC AGGCTACAGC GCCACGGATA TCATTGAATA CGTTAAAATA TTACTAATGA GCCGCGGCGG CGACCTCGGC ATGATGATTA TGATGCTGTG TGGATTTGCC GCTTACATGA CCCATATCGG CGCGAATGAT ATGGTGGTCA AGCTGGCGTC AAAACCATTG CAGTATATTA ACTCCCCTTA CCTGCTGATG ATTGCCGCCT ATTTTGTCGC CTGTCTAATG TCTCTGGCCG TCTCTTCCGC AACCGGTCTG GGTGTTTTGC TGATGGCAAC CCTATTTCCG GTGATGGTAA ACGTTGGTAT CAGTCGTGGC GCAGCTGCTG CCATTTGTGC CTCCCCGGCG GCGATTATTC TCGCACCGAC TTCAGGGGAT GTGGTGCTGG CGGCGCAAGC TTCCGAAATG TCGCTGATTG ACTTCGCCTT CAAAACGACG CTGCCTATCT CAATTGCTGC AATTATCGGC ATGGCGATCG CCCACTTCTT CTGGCAACGT TATCTGGATA AAAAAGAGCA CATCTCTCAT GAAATGTTAG ATGTCAGTGA AATCACCACC ACTGCCCCTG CGTTTTATGC CATTTTGCCG TTCACGCCGA TCATCGGTGT ACTGATTTTT GACGGTAAAT GGGGTCCGCA ATTACACATC ATCACTATTC TGGTGATTTG TATGCTGATT GCCTCCATTC TGGAGTTCCT CCGCAGCTTT AATACCCAGA AAGTTTTCTC TGGTCTGGAA GTGGCTTATC GCGGGATGGC AGATGCGTTT GCTAACGTGG TGATGCTGCT GGTTGCCGCT GGGGTATTCG CTCAGGGGCT TAGCACCATC GGCTTTATTC AAAGTCTGAT TTCTATCGCT ACCTCGTTTG GTTCGGCGAG TATCATCCTG ATGCTGGTAT TGGTGATTCT GACAATGCTG GCGGCAGTCA CGACCGGTTC AGGCAATGCG CCGTTTTATG CGTTTGTTGA GATGATCCCG AAACTGGCGC ACTCTTCCGG CATTAACCCG GCGTATTTGA CTATCCCGAT GCTGCAGGCG TCAAACCTGG GCCGTACCCT TTCGCCCGTT TCTGGCGTAG TCGTTGCGGT TGCCGGGATG GCGAAGATCT CGCCGTTTGA AGTCGTAAAA CGCACCTCGG TACCGGTGCT TGTTGGTTTG GTGATTGTTA TCGTTGCTAC AGAGCTGATG GTGCCAGGAA CGGCAGCAGC GGTCACAGGC AAGTAA
|
Protein sequence | MLTFIELLIG VVVIVGVARY IIKGYSATGV LFVGGLLLLI ISAIMGHKVL PSSQASTGYS ATDIIEYVKI LLMSRGGDLG MMIMMLCGFA AYMTHIGAND MVVKLASKPL QYINSPYLLM IAAYFVACLM SLAVSSATGL GVLLMATLFP VMVNVGISRG AAAAICASPA AIILAPTSGD VVLAAQASEM SLIDFAFKTT LPISIAAIIG MAIAHFFWQR YLDKKEHISH EMLDVSEITT TAPAFYAILP FTPIIGVLIF DGKWGPQLHI ITILVICMLI ASILEFLRSF NTQKVFSGLE VAYRGMADAF ANVVMLLVAA GVFAQGLSTI GFIQSLISIA TSFGSASIIL MLVLVILTML AAVTTGSGNA PFYAFVEMIP KLAHSSGINP AYLTIPMLQA SNLGRTLSPV SGVVVAVAGM AKISPFEVVK RTSVPVLVGL VIVIVATELM VPGTAAAVTG K
|
| |