Gene EcSMS35_3522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3522 
Symbol 
ID6144037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3601794 
End bp3603161 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content51% 
IMG OID641618351 
Productputative cryptic C4-dicarboxylate transporter DcuD 
Protein accessionYP_001745498 
Protein GI170681874 
COG category[C] Energy production and conversion 
COG ID[COG3069] C4-dicarboxylate transporter 
TIGRFAM ID[TIGR00771] c4-dicarboxylate anaerobic carrier family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00146557 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGGCA TAATTATATC TGTCATCGTA TTAATTACGA TGGGCTATTT GATCCTGAAA 
AACTACAAAC CTCAGGTGGT GCTGGCTGCC GCAGGTATCT TCCTGATGAT GTGCGGTGTC
TGGTTAGGGT TCGGTGGTGT GCTCGATCCC GCCAAAAGCA GCGGCTACTT GATCGTCGAT
ATTTATAATG AAATCCTGCG CATGCTGTCC AACCGCATTG CCGGATTGGG GCTGTCGATT
ATGGCAGTGG GCGGTTATGC CCGCTATATG GAGCGCATAG GGGCAAGTCG CGCGATGGTG
AGCTTGTTAA GCCGCCCGTT AAAACTCATT CGCTCGCCGT ATATTATTCT GTCGGCAACT
TACGTCATCG GCCAAATCAT GGCGCAGTTT ATTACCAGCG CCTCCGGTCT GGGTATGTTG
CTGATGGTCA CCTTATTTCC GACGCTGGTG AGTCTGGGAG TAAGTCGTCT CTCTGCGGTG
GCGGTTATCG CAACCACCAT GTCCATTGAG TGGGGGATTC TGGAAACGAA CTCCATTTTT
GCTGCCCAGG TCGCAGGAAT GAAAATTGCC ACTTACTTCT TCCACTACCA GCTTCCGGTC
GCCTCTTGCG TCATTATCTC GGTGGCGATC TCCCACTTTT TCGTGCAACG CGCTTTCGAC
AAAAAAGATA AACATATCAA TCACGAACAG GCAGAGCAAA AAGCTCTCGA TAATGTCCCG
CCGCTCTATT ACGCCATTTT ACCGGTGATG CCGTTAATCC TGATGCTCGG CTCGCTGTTC
CTTGCCCACG TCGGGCTGAT GCAGTCAGAA CTGCATCTGG TGGTGGTGAT GTTACTGAGT
TTGACGGTGA CGATGTTTGT TGAGTTCTTC CGCAAGCATA ACTTGCGCGA AACAATGGAC
GATGTGCAGG CGTTTTTTGA CGGCATGGGT ACGCAGTTTG CCAACGTGGT AACGCTGGTG
GTCGCGGGTG AAATATTTGC GAAAGGCTTA ACGACGATTG GCACTGTCGA TGCGGTTATC
AGGGGGGCGG AGCATTCTGG TCTGGGCGGT ATTGGCGTGA TGATTATTAT GGCGCTGGTC
ATTGCCATTT GTGCCATTGT GATGGGCTCA GGCAATGCGC CGTTTATGTC ATTTGCCAGT
CTTATTCCGA ATATCGCAGC CGGACTACAT GTACCAGCGG TTGTAATGAT TATGCCAATG
CATTTTGCCA CGACGCTAGC GCGCGCTGTT TCGCCGATTA CTGCGGTGGT GGTCGTTACG
TCAGGAATTG CAGGCGTTTC GCCTTTTGCG GTAGTGAAGC GGACAGCGAT CCCCATGGCA
GTCGGTTTCG TGGTGAATAT GATTGCCACT ATCACGCTAT TTTATTAA
 
Protein sequence
MFGIIISVIV LITMGYLILK NYKPQVVLAA AGIFLMMCGV WLGFGGVLDP AKSSGYLIVD 
IYNEILRMLS NRIAGLGLSI MAVGGYARYM ERIGASRAMV SLLSRPLKLI RSPYIILSAT
YVIGQIMAQF ITSASGLGML LMVTLFPTLV SLGVSRLSAV AVIATTMSIE WGILETNSIF
AAQVAGMKIA TYFFHYQLPV ASCVIISVAI SHFFVQRAFD KKDKHINHEQ AEQKALDNVP
PLYYAILPVM PLILMLGSLF LAHVGLMQSE LHLVVVMLLS LTVTMFVEFF RKHNLRETMD
DVQAFFDGMG TQFANVVTLV VAGEIFAKGL TTIGTVDAVI RGAEHSGLGG IGVMIIMALV
IAICAIVMGS GNAPFMSFAS LIPNIAAGLH VPAVVMIMPM HFATTLARAV SPITAVVVVT
SGIAGVSPFA VVKRTAIPMA VGFVVNMIAT ITLFY