Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0536 |
Symbol | dcuC |
ID | 6268596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 513612 |
End bp | 514997 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641724745 |
Product | C4-dicarboxylate transporter DcuC |
Protein accession | YP_001879292 |
Protein GI | 187733524 |
COG category | [C] Energy production and conversion |
COG ID | [COG3069] C4-dicarboxylate transporter |
TIGRFAM ID | [TIGR00771] c4-dicarboxylate anaerobic carrier family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGACAT TCATTGAACT CCTTATTGGG GTTGTGGTTA TTGTGGGTGT AGCTCGCTAC ATCATTAAAG GGTATTCCGC CACTGGTGTG TTATTTGTCG GTGGCCTGTT ATTGCTGATT ATCAGTGCCA TTATGGGGCA CAAAGTGTTA CCGTCCAGCC AGGCTTCAAC AGGCTACAGC GCCACGGATA TCGTTGAATA CGTTAAAATA TTACTAATGA GCCGCGGCGG CGACCTCAGC ATGATGATTA TGATGCTGTG TGGATTTGCC GCTTACATGA CCCATATCGG CGCGAATGAT ATGGTGGTCA AGCTGGCGTC AAAACCATTG CAGTATATTA ACTCCCCTTA CCTGCTGATG ATTGCCGCCT ATTTTGTCGC CTGTCTGATG TCTCTAGCCG TCTCTTCCGC AACCGGTCTG GGTGTTTTGC TGATGGCAAC CCTATTTCCG GTGATGGTAA ACGTTGGTAT CAGTCGTGGC GCAGCTGCTG CCATTTGTGC CTCCCCGGCG GCGATTATTC TCGCACCGAC TTCAGGGGAT GTGGTGCTGG CGGCGCAAGC TTCCGAAATG TCGCTGATTG ACTTCGCCTT CAAAACGACG CTGCCTATCT CAATTGCTGC AATTATCGGC ATGGCGATCG CCCACTTCTT CTGGCAACGT TATCTGGATA AAAAAGAGCA CATCTCTCAT GAAATGTTAG ATGTCAGTGA AATCACCACC ACTGCTCCTG CGTTTTATGC CATTTTGCCG TTCACGCCGA TCATCGGTGT ACTGATTTTT GACGGTAAAT GGGGTCCGCA ATTACACATC ATCACTATTC TGGTGATTTG TATGCTGATT GCCTCCATTC TGGAGTTCCT CCGCAGCTTT AATACCCAGA AAGTTTTCTC TGGTCTGGAA GTGGCTTATC GCGGGATGGC AGATGCGTTT GCTAACGTGG TGATGCTGCT GGTTGCCGCT GGGGTATTCG CTCAGGGGCT TAGCACCATC GGCTTTATTC AAAGTCTGAT TTCTATCGCT ACCTCGTTTG GTTCGGCGAG TATCATCCTG ATGCTGGTAT TGGCGATTCT GACAATGCTG GCGGCAGTCA CGACCGGTTC AGGCAATGCG CCGTTTTATG CGTTTGTTGA GATGATCCCG AAACTGGCGC ACTCTTCCGG CATTAACCCG GCGTATTTGA CTATCCCGAT GCTGCAGGCG TCAAACCTTG GCCGTACCCT TTCGCCCGTT TCTGGCGTAG TCGTTGCGGT TGCCGGGATG GCGAAGATCT CGCCGTTTGA AGTCGTAAAA CGCACCTCGG TACCGGTGCT TGTTGGTTTG GTGATTGTTA TCGTTGCTAC AGAGCTGATG GTGCCAGGAA CGGCAGCAGC GGTCACAGGC AAGTAA
|
Protein sequence | MLTFIELLIG VVVIVGVARY IIKGYSATGV LFVGGLLLLI ISAIMGHKVL PSSQASTGYS ATDIVEYVKI LLMSRGGDLS MMIMMLCGFA AYMTHIGAND MVVKLASKPL QYINSPYLLM IAAYFVACLM SLAVSSATGL GVLLMATLFP VMVNVGISRG AAAAICASPA AIILAPTSGD VVLAAQASEM SLIDFAFKTT LPISIAAIIG MAIAHFFWQR YLDKKEHISH EMLDVSEITT TAPAFYAILP FTPIIGVLIF DGKWGPQLHI ITILVICMLI ASILEFLRSF NTQKVFSGLE VAYRGMADAF ANVVMLLVAA GVFAQGLSTI GFIQSLISIA TSFGSASIIL MLVLAILTML AAVTTGSGNA PFYAFVEMIP KLAHSSGINP AYLTIPMLQA SNLGRTLSPV SGVVVAVAGM AKISPFEVVK RTSVPVLVGL VIVIVATELM VPGTAAAVTG K
|
| |