Gene EcSMS35_4048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4048 
Symbol 
ID6146725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4141377 
End bp4143038 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content53% 
IMG OID641618874 
Producthypothetical protein 
Protein accessionYP_001746012 
Protein GI170683682 
COG category[R] General function prediction only 
COG ID[COG2985] Predicted permease 
TIGRFAM ID[TIGR01625] AspT/YidE/YbjL antiporter duplication domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.926821 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATA TAGCATTAAC GGTCAGTATT CTGGCTTTGG TGGCAGTCGT CGGTTTGTTT 
ATCGGCAACG TCAAATTTCG CGGCATAGGA TTAGGTATTG GCGGCGTGCT GTTTGGTGGG
ATCATCGTCG GCCATTTTGT TTCTCAGGCG GGGATGACAT TAAGTAGCGA TATGCTGCAT
GTTATTCAGG AATTTGGCCT GATCCTGTTC GTTTATACCA TCGGGATTCA GGTAGGGCCG
GGCTTCTTTG CCTCATTGCG CGTCTCCGGA TTACGCCTCA ACCTGTTTGC TGTTCTGATT
GTCATCATCG GGGGTCTGGT TACCGCCATC CTGCATAAAC TGTTTGATAT TCCACTGCCG
GTAGTGCTGG GGATTTTCTC CGGTGCGGTA ACCAATACGC CAGCGCTGGG GGCAGGGCAG
CAGATCTTGC GCGACCTGGG TACACCAATG GAAATGGTCG ATCAGATGGG GATGAGTTAC
GCGATGGCGT ATCCATTCGG TATTTGCGGG ATTTTGTTCA CCATGTGGAT GTTGCGGGTT
ATTTTCCGCG TCAATGTCGA GACAGAAGCC CAGCAGCACG AGTCTTCTCG CACCAATGGC
GGCGCGCTGA TCAGGACTAT CAATATTCGC GTTGAGAACC CTAACCTGCA TGATTTAGCC
ATTAAAGATG TACCGATTCT CAACGGCGAC AAAATTATCT GCTCGCGTCT GAAACGCGAA
GAAACCCTAA AAGTACCTTC GCCAGATACC ATTATCCAAC TGGGCGATTT GCTGCATCTG
GTGGGGCAGC CAGCGGATTT ACATAATGCG CAACTGGTGA TTGGTCAGGA GGTCGATACC
TCGTTGTCCA CGAAAGGCAC TGATTTACGC GTCGAGCGTG TGGTGGTCAC CAATGAAAAC
GTGCTCGGTA AACGTATTCG CGACCTGCAC TTTAAAGAAC GCTATGACGT TGTTATCTCG
CGCCTGAACC GTGCCGGGGT CGAACTGGTC GCCAGTGGCG ATATCAGCCT GCAGTTCGGC
GATATCCTCA ACCTGGTGGG GCGTCCGTCC GCAATTGATG CCGTTGCCAA TGTGCTGGGG
AATGCGCAGC AAAAACTGCA ACAGGTTCAG ATGTTGCCGG TGTTTATTGG TATCGGGCTT
GGCGTATTGT TAGGTTCTAT TCCCGTCTTT GTGCCGGGAT TCCCGGCCGC GTTGAAACTG
GGACTGGCAG GCGGCCCGCT GATTATGGCG TTGATCCTCG GGCGTATCGG CAGTATCGGC
AAGCTGTACT GGTTTATGCC GCCAAGCGCC AACCTCGCGC TGCGGGAGCT GGGGATCGTA
CTGTTCCTCT CGGTAGTGGG GCTGAAATCT GGTGGGGATT TTGTAAATAC CCTGGTCAAT
GGCGAAGGGC TAAGCTGGAT TGGATATGGT GCCCTGATCA CCGCCGTTCC GTTGATTACT
GTTGGTATTC TGGCGCGGAT GTTAGCCAAA ATGAATTACC TGACCATGTG CGGGATGCTG
GCTGGCTCCA TGACCGATCC ACCGGCGCTG GCATTTGCTA ATAATCTTCA TCCAACGAGC
GGTGCAGCGG CGCTCTCTTA CGCCACTGTC TATCCGTTAG TGATGTTCCT GCGCATTATC
ACCCCCCAAT TACTGGCGGT GCTCTTCTGG AGTATCGGTT AA
 
Protein sequence
MSDIALTVSI LALVAVVGLF IGNVKFRGIG LGIGGVLFGG IIVGHFVSQA GMTLSSDMLH 
VIQEFGLILF VYTIGIQVGP GFFASLRVSG LRLNLFAVLI VIIGGLVTAI LHKLFDIPLP
VVLGIFSGAV TNTPALGAGQ QILRDLGTPM EMVDQMGMSY AMAYPFGICG ILFTMWMLRV
IFRVNVETEA QQHESSRTNG GALIRTINIR VENPNLHDLA IKDVPILNGD KIICSRLKRE
ETLKVPSPDT IIQLGDLLHL VGQPADLHNA QLVIGQEVDT SLSTKGTDLR VERVVVTNEN
VLGKRIRDLH FKERYDVVIS RLNRAGVELV ASGDISLQFG DILNLVGRPS AIDAVANVLG
NAQQKLQQVQ MLPVFIGIGL GVLLGSIPVF VPGFPAALKL GLAGGPLIMA LILGRIGSIG
KLYWFMPPSA NLALRELGIV LFLSVVGLKS GGDFVNTLVN GEGLSWIGYG ALITAVPLIT
VGILARMLAK MNYLTMCGML AGSMTDPPAL AFANNLHPTS GAAALSYATV YPLVMFLRII
TPQLLAVLFW SIG