Gene EcSMS35_2105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2105 
Symbol 
ID6144740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2113758 
End bp2114885 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content51% 
IMG OID641616981 
Producthypothetical protein 
Protein accessionYP_001744156 
Protein GI170682015 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2822] Predicted periplasmic lipoprotein involved in iron transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.258925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTA ACTTCCGCCG TAACGCATTG CAGTTGAGCG TGGCTGCGCT GTTCTCTTCT 
GCTTTTATGG CTAACGCCGC TGATATTCCG CAAGTCAAAG TGACCGTAAC GGATAAGCAG
TGCGAACCGA TGACCATTAC GGTTAACGCC GGGAAAACAC AGTTCATTAT TCAGAACCAC
AGCCAGAAGG CGCTGGAGTG GGAGATCCTG AAAGGCGTGA TGGTGGTGGA AGAGCGGGAA
AATATCGCCC CTGGCTTTAG CCAGAAAATG ACGGCGAATT TACAGCCTGG CGAATACGAT
ATGACCTGCG GTCTGCTGAC TAACCCGAAA GGGAAGTTGA TCGTCAAAGG TGAGGCAACG
GCGGATGCGG CGCAAAGTGA TGCGCTGTTA AGTCTTGGTG GTGCAATTAC TGCATATAAA
GCGTATGTCA TGGCGGAAAC CACGCAGCTG GTGACCGACA CCAAAGCCTT TACCGACGCG
ATTAAAGCAG GCGATATCGA AAAAGCGAAA GCACTGTATG CGCCGACGCG CCAGCACTAT
GAGCGCATTG AACCGATTGC TGAACTGTTC TCCGATCTGG ATGGCAGCAT TGACGCCCGT
GAAGATGATT ACGAGCAAAA AGCCGCCGAT CCAAAATTCA CCGGTTTCCA CCGTCTGGAA
AAAGCATTGT TTGGCGACAA CACCACCAAA GGGATGGATA AGTACGCTGA CCAGCTTTAT
ACCGATGTGG TCGATTTGCA AAAACGCATC AGTGAACTGG CTTTCCCACC TTCAAAAGTG
GTCGGCGGTG CAGCCGGACT GATTGAGGAA GTGGCAGCCA GCAAAATCAG CGGTGAAGAA
GATCGCTACA GCCACACCGA TCTGTGGGAT TTCCAGGCTA ACGTTGAAGG CTCGCAGAAA
ATTGTCGATC TGCTGCGTCC ACAACTACAA AAAGCGAATC CGGAACTGCT GGCAAAAGTC
GATGCCAATT TCAAAAAGGT CGATACCATT CTGGCGAAAT ACCGTACTAA AGACGGTTTT
GAAACCTACG ACAAATTGAC CGATGCTGAC CGTAATGCGC TGAAAGGGCC GATTACTGCG
CTGGCGGAAG ATCTGGCGCA ACTTCGCGGT GTGCTGGGGC TGGACTAA
 
Protein sequence
MTINFRRNAL QLSVAALFSS AFMANAADIP QVKVTVTDKQ CEPMTITVNA GKTQFIIQNH 
SQKALEWEIL KGVMVVEERE NIAPGFSQKM TANLQPGEYD MTCGLLTNPK GKLIVKGEAT
ADAAQSDALL SLGGAITAYK AYVMAETTQL VTDTKAFTDA IKAGDIEKAK ALYAPTRQHY
ERIEPIAELF SDLDGSIDAR EDDYEQKAAD PKFTGFHRLE KALFGDNTTK GMDKYADQLY
TDVVDLQKRI SELAFPPSKV VGGAAGLIEE VAASKISGEE DRYSHTDLWD FQANVEGSQK
IVDLLRPQLQ KANPELLAKV DANFKKVDTI LAKYRTKDGF ETYDKLTDAD RNALKGPITA
LAEDLAQLRG VLGLD