Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B0668 |
Symbol | dcuC |
ID | 6796833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 665685 |
End bp | 667070 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642774946 |
Product | C4-dicarboxylate transporter DcuC |
Protein accession | YP_002145601 |
Protein GI | 197250683 |
COG category | [C] Energy production and conversion |
COG ID | [COG3069] C4-dicarboxylate transporter |
TIGRFAM ID | [TIGR00771] c4-dicarboxylate anaerobic carrier family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0313164 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAACAG TTATAGAGCT CCTTATCGGA GTCGTCGTTA TTGTGGGTGT AGCGCGCTAC ATCATTAAGG GATATTCCGC CACTGGCGTT TTATTTGTCG GCGGTCTGGT TCTACTTATT ATCAGCGCGC TGATGGGGCA TAAGGTATTA CCTGCCAGCG AAACCAGTAC CGGCTATACC GCAACAGATA TTGTTGAATA CATCAAAATT TTGCTTATGA GTCGCGGCGG CGATCTTGGC ATGATGATCA TGATGCTGTG CGGCTTTGCC GCCTATATGA CGCATATCGG CGCGAATGAT ATGGTCGTGA AGCTGGCGTC GAAACCTTTA CAGTACATAA ACTCGCCCTA CCTGTTGATG ATTGCTGCCT ATTTTGTCGC CTGCCTGATG TCGCTGGCCG TCTCTTCCGC GACGGGGCTT GGCGTACTGC TCATGGCTAC TCTCTTCCCG GTGATGGTCA ACGTCGGCAT TAGCCGCGGC GCGGCGGCGG CTATTTGCGC CTCTCCGGCC GCCATTATCC TCTCGCCAAC GTCTGGCGAT GTGGTTTTAG CCGCAAAAGC GGCGGAGATG CCGTTAATCG ATTTCGCGTT CAAAACCACG CTGCCAATTT CCATCGCCGC CATTATCGGT ATGGCGATTG CGCACTTTTT CTGGCAACGC TATCTGGATA AAAAAGAGAA CATTTCACAT GAGATGCTGG ACGTGGCGGA GATTACCACC ACAGCCCCGG CGTTTTATGC CCTTTTACCG TTCACGCCCA TTATCGGCGT ACTGATTTTT GACGGTAAAT GGGGGCCGCA GTTGCACATT ATTACCATTC TGGTGATCTG TATGCTGCTG GCCGCCGTGC TGGAATTCGT GCGCGGCTTC AACACGCAAA ATGTGTTTTC CGGCCTGGAA GTGGCTTATC GCGGCATGGC GGATGCGTTT GCCGGCGTTG TGATGCTGCT GGTCGCGGCG GGCGTTTTCG CCCAGGGGCT GAGCACCATT GGCTTTATCC AAAGTCTGAT CTCGATTGCC ACTTCTTTTG GCTCCGCCAG CATTATTCTG ATGCTGGTCT TAGTGATTTT AACCATGCTG GCCGCGATGA CCACCGGTTC AGGCAATGCG CCTTTTTACG CCTTTGTTGA GATGATCCCT AAACTGGCGC ACTCCTCCGG TATCAACCCA GCCTATTTAT CCATCCCAAT GCTGCAAGCC TCAAACCTGG GGCGCACGAT TTCGCCGGTC TCCGGCGTCG TGGTCGCTGT CGCCGGGATG GCTAAAATAT CACCGTTTGA AGTGGTGAAA CGCACGTCCG TTCCGGTCAT CGTCGGTCTG TTGATCGTCA TTATCGCCAC GGAAATTATG GTGCCGGGCG CCTCTTCCGC CGTTACTGAC GGCTGA
|
Protein sequence | MLTVIELLIG VVVIVGVARY IIKGYSATGV LFVGGLVLLI ISALMGHKVL PASETSTGYT ATDIVEYIKI LLMSRGGDLG MMIMMLCGFA AYMTHIGAND MVVKLASKPL QYINSPYLLM IAAYFVACLM SLAVSSATGL GVLLMATLFP VMVNVGISRG AAAAICASPA AIILSPTSGD VVLAAKAAEM PLIDFAFKTT LPISIAAIIG MAIAHFFWQR YLDKKENISH EMLDVAEITT TAPAFYALLP FTPIIGVLIF DGKWGPQLHI ITILVICMLL AAVLEFVRGF NTQNVFSGLE VAYRGMADAF AGVVMLLVAA GVFAQGLSTI GFIQSLISIA TSFGSASIIL MLVLVILTML AAMTTGSGNA PFYAFVEMIP KLAHSSGINP AYLSIPMLQA SNLGRTISPV SGVVVAVAGM AKISPFEVVK RTSVPVIVGL LIVIIATEIM VPGASSAVTD G
|
| |