Gene EcSMS35_0523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0523 
Symbolfsr 
ID6145526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp531682 
End bp532902 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content53% 
IMG OID641615417 
Productfosmidomycin resistance protein 
Protein accessionYP_001742624 
Protein GI170682753 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATGA GTGAACAAAC CCAGCCTGTG GCGGGCGCGG CTGCGTCAAC GACCAAGGCC 
CGAACATCGT TTGGTATTTT AGGTGCTATC AGCCTCTCAC ATCTGCTGAA CGACATGATC
CAATCGCTGA TTCTGGCGAT TTATCCGCTG CTTCAGTCAG AGTTTTCTCT GACATTTATG
CAGATTGGCA TGATAACCCT CACCTTCCAG CTCGCCTCTT CGCTACTGCA ACCGGTGGTC
GGTTACTGGA CCGATAAATA TCCGATGCCG TGGTCGTTGC CAATTGGCAT GTGCTTTACC
TTAAGTGGTC TGGTGCTGCT TGCGCTGGCG GGCAGTTTTG GCGCAGTTCT GCTGGCGGCG
GCGCTGGTCG GTACCGGTTC ATCGGTCTTT CATCCGGAAT CTTCTCGCGT GGCCCGTATG
GCTTCCGGCG GGCGGCATGG CCTGGCGCAA TCTATCTTTC AGGTCGGCGG CAACTTTGGC
AGTTCCCTGG GCCCCTTGCT GGCGGCGGTG ATTATCGCGC CTTATGGTAA AGGCAACGTT
GCCTGGTTTG TGCTTGCGGC ACTGCTGGCG ATCGTGGTGT TGGCGCAAAT CAGCCGTTGG
TACTCGGCAC AGCACCGAAT GAATAAAGGA AAACCCAAAG CGACGATTAT CAATCCACTG
CCGCGTAACA AAGTGGTACT GGCGGTCAGC ATTCTGTTAA TCCTCATTTT CTCGAAATAT
TTCTATATGG CGAGCATCAG CAGCTATTAC ACCTTTTATC TGATGCAAAA ATTCGGATTA
TCTATCCAGA ATGCTCAGCT TCATCTGTTT GCCTTCCTGT TTGCCGTTGC GGCAGGTACG
GTGATCGGCG GGCCTGTAGG GGATAAAATT GGGCGGAAAT ATGTGATTTG GGGCTCTATC
CTCGGCGTTG CGCCGTTTAC GCTGATTTTA CCCTACGCCA GCCTGCACTG GACGGGGGTT
TTAACGGTGA TTATTGGATT TATCCTCGCT TCGGCATTCT CTGCCATTCT GGTCTACGCT
CAGGAGCTGC TTCCGGGACG TATCGGTATG GTTTCTGGAC TCTTTTTCGG TTTTGCTTTT
GGCATGGGAG GTCTGGGAGC GGCAGTTCTG GGGCTTATCG CCGATCACAC CAGCATCGAG
TTAGTCTATA AAATCTGTGC TTTCCTGCCA CTATTGGGGA TGTTGACCAT ATTCCTGCCT
GATAACCGGC ATAAAGACTG A
 
Protein sequence
MAMSEQTQPV AGAAASTTKA RTSFGILGAI SLSHLLNDMI QSLILAIYPL LQSEFSLTFM 
QIGMITLTFQ LASSLLQPVV GYWTDKYPMP WSLPIGMCFT LSGLVLLALA GSFGAVLLAA
ALVGTGSSVF HPESSRVARM ASGGRHGLAQ SIFQVGGNFG SSLGPLLAAV IIAPYGKGNV
AWFVLAALLA IVVLAQISRW YSAQHRMNKG KPKATIINPL PRNKVVLAVS ILLILIFSKY
FYMASISSYY TFYLMQKFGL SIQNAQLHLF AFLFAVAAGT VIGGPVGDKI GRKYVIWGSI
LGVAPFTLIL PYASLHWTGV LTVIIGFILA SAFSAILVYA QELLPGRIGM VSGLFFGFAF
GMGGLGAAVL GLIADHTSIE LVYKICAFLP LLGMLTIFLP DNRHKD