Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3023 |
Symbol | dcuC |
ID | 6066017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3301267 |
End bp | 3302652 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641602439 |
Product | C4-dicarboxylate transporter DcuC |
Protein accession | YP_001725974 |
Protein GI | 170021020 |
COG category | [C] Energy production and conversion |
COG ID | [COG3069] C4-dicarboxylate transporter |
TIGRFAM ID | [TIGR00771] c4-dicarboxylate anaerobic carrier family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.780031 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGACAT TCATTGAACT CCTTATTGGG GTTGTGGTTA TTGTGGGTGT AGCTCGCTAC ATCATTAAAG GGTATTCCGC CACTGGTGTG TTATTTGTCG GTGGCCTGTT ATTGCTGATT ATCAGTGCCA TTATGGGGCA CAAAGTGTTA CCGTCCAGCC AGGCTTCAAC AGGCTACAGC GCCACGGATA TCGTTGAGTA CGTTAAAATA TTGCTAATGA GCCGCGGCGG CGACCTCGGC ATGATGATTA TGATGCTGTG TGGCTTTGCT GCTTACATGA CCCATATCGG CGCGAATGAT ATGGTGGTCA AGCTGGCGTC AAAACCATTG CAGTATATTA ACTCCCCCTA CCTGCTGATG ATTGCCGCCT ATTGTGTTGC CTGTCTGATG TCACTGGCCG TCTCTTCCGC AACCGGTCTG GGTGTTTTGC TGATGGCAAC CCTGTTTCCG GTGATGGTAA ACGTTGGTAT CAGTCGTGGG GCAGCAGCTG CCATTTGTGC CTCCCCGGCG GCGATTATTC TCGCACCGAC TTCAGGGGAT GTGGTGCTGG CGGCGCAGGC TTCCGAAATG TCGCTGATTG ACTTCGCCTT CAAAACGACA CTGCCTATCT CAATTGCTGC AATTATCGGC ATGGCGATCG CCCACTTCTT CTGGCAACGT TATCTGGATA AAAAAGAGCA CATCTCTCAT GAAATGTTAG ATGTCAGTGA AATCACTACC ACTGCCCCTG CGTTTTATGC CATTTTGCCG TTCACGCCGA TCATCGGTGT ACTGATTTTT GACGGTAAAT GGGGTCCGCA ATTACACATC ATCACTATTC TGGTGATTTG TATGCTGATT GCCTCCATTC TGGAGTTCCT CCGCAGCTTT AATACCCAGA AAGTTTTCTC TGGTCTGGAA GTGGCTTATC GCGGGATGGC AGATGCGTTT GCTAACGTGG TGATGCTGCT GGTTGCCGCT GGGGTATTCG CTCAGGGGCT TAGCACCATC GGCTTTATTC AAAGTCTGAT TTCTATCGCT ACCTCGTTTG GTTCGGCGAG TATCATCCTG ATGCTGGTAT TGGTGATTCT GACCATGCTG GCGGCAGTCA CGACCGGTTC AGGCAATGCG CCGTTTTATG CGTTTGTTGA GATGATCCCG AAACTGGCGC ACTCTTCCGG CATTAACCCG GCGTATTTGA CTATCCCGAT GCTGCAGGCG TCAAACCTGG GTCGTACCCT ATCACCCGTT TCTGGCGTAG TCGTTGCGGT TGCCGGGATG GCGAAAATCT CACCATTTGA AGTCGTAAAA CGCACCTCGG TGCCGGTGCT TGTTGGGCTA GTGATTGTTA TCGTTGCTAC AGAGCTGATG GTGCCAGGAA CGGCAGCAGC GGTCACAGGC AAGTAA
|
Protein sequence | MLTFIELLIG VVVIVGVARY IIKGYSATGV LFVGGLLLLI ISAIMGHKVL PSSQASTGYS ATDIVEYVKI LLMSRGGDLG MMIMMLCGFA AYMTHIGAND MVVKLASKPL QYINSPYLLM IAAYCVACLM SLAVSSATGL GVLLMATLFP VMVNVGISRG AAAAICASPA AIILAPTSGD VVLAAQASEM SLIDFAFKTT LPISIAAIIG MAIAHFFWQR YLDKKEHISH EMLDVSEITT TAPAFYAILP FTPIIGVLIF DGKWGPQLHI ITILVICMLI ASILEFLRSF NTQKVFSGLE VAYRGMADAF ANVVMLLVAA GVFAQGLSTI GFIQSLISIA TSFGSASIIL MLVLVILTML AAVTTGSGNA PFYAFVEMIP KLAHSSGINP AYLTIPMLQA SNLGRTLSPV SGVVVAVAGM AKISPFEVVK RTSVPVLVGL VIVIVATELM VPGTAAAVTG K
|
| |