Gene EcSMS35_0640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0640 
SymboldcuC 
ID6144126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp653982 
End bp655367 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content51% 
IMG OID641615532 
ProductC4-dicarboxylate transporter DcuC 
Protein accessionYP_001742738 
Protein GI170680591 
COG category[C] Energy production and conversion 
COG ID[COG3069] C4-dicarboxylate transporter 
TIGRFAM ID[TIGR00771] c4-dicarboxylate anaerobic carrier family protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.907894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACAT TCATTGAACT CCTTATTGGG GTTGTGGTTA TTGTGGGTGT AGCTCGCTAC 
ATCATTAAAG GGTATTCCGC CACTGGCGTG TTATTTGTCG GTGGCCTGTT ATTGCTGATT
ATCAGTGCCA TTATGGGGCA CAAAGTGTTA CCGTCCAGCC AGGCTTCAAC AGGCTACAGC
GCCACGGATA TCGTTGAATA CGTTAAAATA TTGCTAATGA GCCGCGGCGG CGACCTCGGC
ATGATGATTA TGATGCTGTG TGGCTTTGCC GCTTACATGA CCCATATCGG CGCGAATGAT
ATGGTGGTCA AGCTGGCGTC AAAACCATTG CAGTATATTA ACTCCCCTTA CCTGCTGATG
ATTGCCGCCT ATTTTGTCGC CTGTCTGATG TCTCTGGCCG TCTCTTCCGC AACCGGTCTG
GGTGTTTTGC TGATGGCAAC CCTGTTTCCG GTGATGGTAA ACGTTGGTAT CAGCCGTGGT
GCAGCTGCTG CTATTTGTGC CTCCCCGGCG GCGATTATTC TCGCACCGAC TTCAGGGGAT
GTGGTGCTGG CGGCGCAGGC TTCCGAAATG TCGCTGATTG ACTTCGCCTT CAAAACGACG
CTGCCTATCT CAATTGCTGC AATTATCGGC ATGGCGATCG CCCACTTCTT CTGGCAACGT
TATCTGGATA AAAAAGAGCA CATCTCTCAT GAAATGTTAG ATGTCAGTGA AATCACCACT
ACTGCTCCTG CGTTTTATGC CATTTTGCCG TTCACGCCGA TCATCGGAGT GCTGATTTTT
GACGGCAAAT GGGGTCCGCA ATTACACATC ATCACTATTC TGGTGATTTG TATGCTGATT
GCCTCCATTC TGGAGTTCAT CCGCAGCTTT AATACCCAGA AAGTTTTCTC TGGTCTGGAA
GTGGCTTATC GCGGGATGGC CGATGCGTTT GCTAACGTGG TGATGCTGCT GGTTGCCGCT
GGGGTATTCG CTCAGGGGCT TAGCACCATC GGCTTTATTC AAAGTCTGAT TTCTATCGCC
ACCTCGTTTG GTTCGGCGAG TATCATCCTG ATGCTGGTAT TAGTGATCCT GACAATGCTG
GCGGCAGTCA CGACCGGTTC AGGCAATGCG CCGTTTTATG CGTTTGTTGA GATGATCCCG
AAACTGGCGC ACTCTTCCGG CATTAACCCG GCGTATTTGA CTATCCCAAT GCTGCAGGCG
TCAAACCTCG GCCGTACCCT GTCACCCGTT TCTGGCGTAG TCGTTGCGGT TGCCGGGATG
GCGAAAATCT CACCATTTGA AGTCGTAAAA CGCACCTCGG TGCCGGTGCT TGTTGGGCTG
GTGATTGTTA TCGTTGCTAC AGAGCTGATG GTGCCAGGAA CGGCAGCCGC GGTCACAGGC
AAGTAA
 
Protein sequence
MLTFIELLIG VVVIVGVARY IIKGYSATGV LFVGGLLLLI ISAIMGHKVL PSSQASTGYS 
ATDIVEYVKI LLMSRGGDLG MMIMMLCGFA AYMTHIGAND MVVKLASKPL QYINSPYLLM
IAAYFVACLM SLAVSSATGL GVLLMATLFP VMVNVGISRG AAAAICASPA AIILAPTSGD
VVLAAQASEM SLIDFAFKTT LPISIAAIIG MAIAHFFWQR YLDKKEHISH EMLDVSEITT
TAPAFYAILP FTPIIGVLIF DGKWGPQLHI ITILVICMLI ASILEFIRSF NTQKVFSGLE
VAYRGMADAF ANVVMLLVAA GVFAQGLSTI GFIQSLISIA TSFGSASIIL MLVLVILTML
AAVTTGSGNA PFYAFVEMIP KLAHSSGINP AYLTIPMLQA SNLGRTLSPV SGVVVAVAGM
AKISPFEVVK RTSVPVLVGL VIVIVATELM VPGTAAAVTG K