Gene EcE24377A_3709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3709 
Symbol 
ID5588536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3704999 
End bp3706321 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content51% 
IMG OID640927332 
Productputative cryptic C4-dicarboxylate transporter DcuD 
Protein accessionYP_001464699 
Protein GI157155112 
COG category[C] Energy production and conversion 
COG ID[COG3069] C4-dicarboxylate transporter 
TIGRFAM ID[TIGR00771] c4-dicarboxylate anaerobic carrier family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000109992 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGGCA TAATTATATC TGTCATCGTA TTAATTACGA TGGGCTATTT GATCCTGAAA 
AACTACAAAC CTCAGGTGGT GCTGGCTGCC GCAGGTATCT TCCTGATGAT GTGCGGTGTC
TGGTTAGGGT TCGGTGGTGT ACTCGATCCC GCCAAAAGCA GCGGCTACTT GATCGTCGAT
ATTTATAATG AAATCCTGCG CATGCTGTCC AACCGCATTG CCGGATTGGG GCTGTCGATT
ATGGCGGTGG GCGGTTATGC CCGCTACATG GAGCGCACAG GAGCCAGTCG CGCGATGGTG
AGCTTGCTAA GCCGCCCGTT AAAACTCATT CGCTCGCCGT ATATTATTCT ATCGGCAACT
TACGTCATCG GCCAAATCAT GGCGCAGTTT ATTACCAGCG CCTCCGGCCT GGGTATGTTG
CTGATGGTCA CCTTATTTCC GACGCTGGTG AGTCTGGGAG TAAGTCGCCT CTCTGCGGTG
GCGGTTATCG CAACCACGAT GTCCATTGAG TGGGGGATTC TGGAAACGAA CTCCATTTTT
GCAGCCCAGG TCGCGGGAAT GAAAATTGCC ACTTACTTCT TCCACTACCA GCTTCCGGTC
GCCTCTTGCG TCATTATCTC GGTGGCGATC TCCCACTTTT TCGTGCAACG CGCTTTTGAC
AAAAAAGATA AAAATATCAA TCACGAACAG GCAGAGCTAA AAGCTCTCGA TAATGTCCCG
CCGCTCTATT ACGCCATTTT ACCTGTGATG CCGTTAATCT TGATGCTCGG CTCGCTGTTC
CTCGCCCACA TCGGGCTGAT GCAGTCAGAA CTGCATCTGG TGGTGGTGAT GTTACTGAGT
TTGACGGTGA CGATGTTTGT TGAGTTCTTC CGCAAGCATA ACTTGCGCGA AACAATGGAC
GATGTGCAGG CGTTTTTTGA CGGCATGGGT ACGCAGTTTG CCAACGTGGT AACGCTGGTG
GTCGCGGGTG AAATATTTGC GAAAGGCTTA ACGACGATTG GCACTGTTGA TGCGGTTATC
AGGGGTGCGG AGCATTCTGG TCTGGGCGGT ATTGGCGTGA TGATTATTAT GGCGCTAGTC
ATTGCCATTT GTGCCATTGT GATGGGCTCT GGCAATGCGC CGTTTATGTC ATTTGCCAGT
CTTATTCCGA ATATCGCAGC CGGACTACAT GTACCAGCGG TTGTAATGAT TATGCCGATG
CATTTTGCCA CGACGCTAGC GCGCGCGGTT TCGCCGATTA CTGCGGTGGT GAAGCGAACA
GCGATCCCCA TGGCAGTCGG TTTCGTGGTG AATATGATTG CCACAATCAC GCTATTTTAT
TAA
 
Protein sequence
MFGIIISVIV LITMGYLILK NYKPQVVLAA AGIFLMMCGV WLGFGGVLDP AKSSGYLIVD 
IYNEILRMLS NRIAGLGLSI MAVGGYARYM ERTGASRAMV SLLSRPLKLI RSPYIILSAT
YVIGQIMAQF ITSASGLGML LMVTLFPTLV SLGVSRLSAV AVIATTMSIE WGILETNSIF
AAQVAGMKIA TYFFHYQLPV ASCVIISVAI SHFFVQRAFD KKDKNINHEQ AELKALDNVP
PLYYAILPVM PLILMLGSLF LAHIGLMQSE LHLVVVMLLS LTVTMFVEFF RKHNLRETMD
DVQAFFDGMG TQFANVVTLV VAGEIFAKGL TTIGTVDAVI RGAEHSGLGG IGVMIIMALV
IAICAIVMGS GNAPFMSFAS LIPNIAAGLH VPAVVMIMPM HFATTLARAV SPITAVVKRT
AIPMAVGFVV NMIATITLFY