Gene EcE24377A_0644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0644 
SymboldcuC 
ID5586869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp677256 
End bp678641 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content51% 
IMG OID640924360 
ProductC4-dicarboxylate transporter DcuC 
Protein accessionYP_001461786 
Protein GI157155488 
COG category[C] Energy production and conversion 
COG ID[COG3069] C4-dicarboxylate transporter 
TIGRFAM ID[TIGR00771] c4-dicarboxylate anaerobic carrier family protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0173361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGACAT TCATTGAACT CCTTATTGGG GTTGTGGTTA TTGTGGGTGT AGCTCGCTAC 
ATCATTAAAG GGTATTCCGC CACTGGTGTG TTATTTGTCG GTGGCCTGTT ATTGCTGATT
ATCAGTGCCA TTATGGGGCA CAAAGTGTTA CCGTCCAGCC AGGCTTCAAC AGGCTACAGC
GCCACGGATA TCGTTGAATA CGTTAAAATA TTACTAATGA GCCGCGGCGG CGACCTCGGC
ATGATGATTA TGATGCTGTG TGGATTTGCC GCTTACATGA CCCATATCGG CGCGAATGAT
ATGGTGGTCA AGCTGGCGTC AAAACCATTG CAGTATATTA ACTCCCCTTA CCTGCTGATG
ATTGCCGCCT ATTTTGTCGC CTGTCTGATG TCTCTGGCCG TCTCTTCCGC AACCGGTCTG
GGTGTTTTGC TGATGGCAAC CCTATTTCCG GTGATGGTAA ACGTTGGTAT CAGTCGTGGC
GCAGCTGCTG CCATTTGTGC CTCCCCGGCG GCGATTATTC TCGCACCGAC TTCAGGGGAT
GTGGTGCTGG CGGCGCAAGC TTCCGAAATG TCGCTGATTG ACTTCGCCTT CAAAACGACG
CTGCCTATCT CAATTGCTGC AATTATCGGC ATGGCGATCG CCCACTTCTT CTGGCAACGT
TATCTGGATA AAAAAGAGCA CATCTCTCAT GAAATGTTAG ATGTCAGTGA AATCACCACC
ACTGCTCCTG CGTTTTATGC CATTTTGCCG TTCACGCCGA TCATCGGTGT ACTAATTTTT
GACGGTAAAT GGGGTCCGCA ATTACACATC ATCACTATTC TGGTGATTTG TATGCTGATT
GCCTCCATTC TGGAGTTCCT CCGCAGCTTT AATACCCAGA AAGTTTTCTC TGGTCTGGAA
GTGGCTTATC GCGGAATGGC AGATGCGTTT GCTAACGTGG TGATGCTGCT GGTTGCCGCT
GGGGTATTCG CTCAGGGGCT TAGCACCATC GGCTTTATTC AAAGTCTGAT TTCTATCGCT
ACCTCGTTTG GTTCGGCGAG TATCATCCTG ATGCTGGTAT TGGTGATTCT GACCATGCTG
GCGGCAGTCA CGACCGGTTC AGGCAATGCG CCGTTTTATG CGTTTGTTGA GATGATCCCG
AAACTGGCGC ACTCTTCCGG CATTAACCCG GCGTATTTGA CTATCCCGAT GCTGCAGGCG
TCAAACCTGG GCCGTACCCT TTCGCCCGTT TCTGGCGTAG TCGTTGCGGT TGCCGGGATG
GCGAAGATCT CGCCGTTTGA AGTCGTAAAA CGCACCTCGG TACCGGTGCT TGTTGGTTTG
GTGATTGTTA TCGTTGCTAC AGAGCTGATG GTGCCAGGAA CGGCAGCAGC GGTCACAGGC
AAGTAA
 
Protein sequence
MLTFIELLIG VVVIVGVARY IIKGYSATGV LFVGGLLLLI ISAIMGHKVL PSSQASTGYS 
ATDIVEYVKI LLMSRGGDLG MMIMMLCGFA AYMTHIGAND MVVKLASKPL QYINSPYLLM
IAAYFVACLM SLAVSSATGL GVLLMATLFP VMVNVGISRG AAAAICASPA AIILAPTSGD
VVLAAQASEM SLIDFAFKTT LPISIAAIIG MAIAHFFWQR YLDKKEHISH EMLDVSEITT
TAPAFYAILP FTPIIGVLIF DGKWGPQLHI ITILVICMLI ASILEFLRSF NTQKVFSGLE
VAYRGMADAF ANVVMLLVAA GVFAQGLSTI GFIQSLISIA TSFGSASIIL MLVLVILTML
AAVTTGSGNA PFYAFVEMIP KLAHSSGINP AYLTIPMLQA SNLGRTLSPV SGVVVAVAGM
AKISPFEVVK RTSVPVLVGL VIVIVATELM VPGTAAAVTG K