Gene EcSMS35_2764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2764 
Symbol 
ID6145375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2843656 
End bp2844918 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content51% 
IMG OID641617634 
Producthypothetical protein 
Protein accessionYP_001744795 
Protein GI170681983 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4536] Putative Mg2+ and Co2+ transporter CorB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000294524 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.119885 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCATTA TTCTGATCAT CATGGTGGTC ATTTCAGCCT ATTTTTCCGG GTCCGAAACC 
GGAATGATGA CCCTCAACCG CTATCGTCTG CGACATATGG CGAAACAGGG TAATCGCTCG
GCCAAACGCG TCGAAAAATT GCTGCGTAAG CCAGACCGCC TGATTAGCCT GGTGTTAATC
GGCAATAACC TGGTCAATAT TCTTGCCTCC GCGCTAGGCA CTATTGTTGG GATGCGTTTG
TACGGCGATG CGGGCGTGGC AATTGCGACT GGTGTGCTGA CTTTTGTCGT GCTGGTTTTT
GCTGAGGTAT TGCCGAAAAC CATTGCCGCG CTGTACCCGG AAAAAGTCGC TTATCCGAGT
AGTTTTCTGC TGGCTCCGCT GCAAATTTTG ATGATGCCGC TGGTCTGGTT GCTGAATGCT
ATCACCCGTA TGCTGATGCG CATGATGGGT ATCAAAACCG ATATCGTGGT TAGCGGCTCT
TTGAGCAAAG AAGAGTTGCG CACTATCGTG CACGAATCGC GCTCACAAAT TTCCCGTCGC
AATCAGGATA TGCTGCTGTC GGTGCTCGAT CTGGAAAAAA TGACCGTTGA TGACATCATG
GTGCCGCGCA GTGAAATTAT CGGTATTGAT ATCAACGATG ACTGGAAATC GATTCTGCGC
CAACTCTCCC ACTCACCTCA CGGGCGCATC GTGCTCTACC GTGATTCGCT GGACGACGCC
ATCAGTATGC TGCGTGTGCG TGAAGCCTGG CGACTCATGT CGGAGAAAAA AGAGTTCACC
AAAGAAACCA TGCTACGCGC CGCGGACGAG ATCTATTTTG TCCCGGAAGG TACGCCGCTC
AGCACGCAGT TGGTGAAGTT TCAGCGCAAC AAAAAGAAAG TCGGCCTGGT CGTCAACGAG
TATGGAGACA TTCAGGGGCT GGTGACGGTT GAAGATATTC TGGAAGAGAT TGTCGGCGAT
TTCACCACGT CGATGTCGCC AACACTTGCC GAAGAAGTTA CGCCACAAAA CGACGGTTCG
GTGATTATTG ATGGCACTGC CAACGTGCGA GAAATTAACA AAGCCTTTAA CTGGCATCTA
CCGGAAGATG ATGCTCGTAC TGTTAACGGC GTCATTCTGG AAGCGCTGGA GGAGATCCCG
GTAGCAGGCA CCCGCGTGCG TATTGGCGAG TACGATATTG ATATTCTCGA CGTTCAGGAC
AATATGATTA AGCAGGTAAA AGTTTTTCCT GTGAAACCGC TACGCGAGAG CGTGGCGGAG
TAA
 
Protein sequence
MIIILIIMVV ISAYFSGSET GMMTLNRYRL RHMAKQGNRS AKRVEKLLRK PDRLISLVLI 
GNNLVNILAS ALGTIVGMRL YGDAGVAIAT GVLTFVVLVF AEVLPKTIAA LYPEKVAYPS
SFLLAPLQIL MMPLVWLLNA ITRMLMRMMG IKTDIVVSGS LSKEELRTIV HESRSQISRR
NQDMLLSVLD LEKMTVDDIM VPRSEIIGID INDDWKSILR QLSHSPHGRI VLYRDSLDDA
ISMLRVREAW RLMSEKKEFT KETMLRAADE IYFVPEGTPL STQLVKFQRN KKKVGLVVNE
YGDIQGLVTV EDILEEIVGD FTTSMSPTLA EEVTPQNDGS VIIDGTANVR EINKAFNWHL
PEDDARTVNG VILEALEEIP VAGTRVRIGE YDIDILDVQD NMIKQVKVFP VKPLRESVAE