Gene ECH74115_0571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0571 
Symbolfsr 
ID6968736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp572852 
End bp574072 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content52% 
IMG OID643384616 
Productfosmidomycin resistance protein 
Protein accessionYP_002269130 
Protein GI209398321 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.652835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATGA GTGAACAAAC CCAGCCTGTG GCGGGCGCGG CTGCGTCAAC GACCAAGGCC 
CGAACATCGT TTGGTATTTT AGGTGCTATC AGCCTCTCAC ATCTGCTGAA CGACATGATC
CAATCGCTGA TTCTGGCGAT TTATCCTCTG CTTCAGTCAG AATTTTCTCT GACATTTATG
CAGATTGGCA TGATAACCCT CACCTTCCAG CTCGCCTCTT CGCTACTGCA ACCAGTGGTC
GGCTACTGGA CCGATAAATA TCCGATGCCG TGGTCGTTGC CAATTGGCAT GTGCTTTACC
TTAAGTGGTC TGGTGCTGCT TGCGCTGGCG GGCAGTTTTG GCGCAGTTCT GCTGGCGGCG
GCGCTGGTCG GTACCGGTTC ATCGGTCTTT CATCCGGAAT CTTCTCGCGT GGCCCGTATG
GCTTCCGGCG GGCGGCATGG CCTGGCACAA TCTATCTTTC AGGTCGGCGG CAACTTTGGC
AGTTCCCTGG GACCCTTGCT GGCGGCGGTG ATTATCGCGC CTTATGGCAA AGGCAACGTT
GCCTGGTTTG TGCTTGCGGC ACTGCTGGCG ATCGTGGTGT TGGCGCAAAT CAGCCGTTGG
TACTCGGCAC AGCACCGAAT GAATAAAGGA AAACCCAAAG CGACGATTAT CAATCCACTG
CCGCGCAACA AAGTGGTACT GGCGGTCAGC ATTCTGTTAA TCCTCATTTT CTCGAAATAT
TTCTATATGG CGAGCATCAG CAGCTATTAC ACCTTTTATC TGATGCAAAA ATTCGGATTA
TCTATCCAGA ATGCTCAGCT TCATCTGTTT GCCTTCCTGT TTGCCGTTGC GGCAGGTACG
GTGATCGGCG GGCCTGTAGG GGATAAAATT GGGCGGAAAT ATGTGATTTG GGGCTCTATC
CTCGGCGTTG CGCCGTTTAC GCTGATTTTA CCCTACGCCA GCCTGCACTG GACGGGGGTT
TTAACGGTGA TTATTGGATT TATCCTCGCT TCGGCATTCT CTGCCATTCT GGTCTACGCT
CAGGAGCTGC TTCCAGGACG TATCGGTATG GTTTCTGGAC TCTTTTTCGG TTTTGCTTTC
GGCATGGGAG GTCTGGGAGC GGCAGTTCTG GGGCTTATCG CCGATCACAC CAGCATCGAG
TTAGTCTATA AAATATGTGC TTTCCTGCCA CTATTGGGGA TGTTGACCAT ATTCCTGCCT
GATAACCGGC ATAAAGACTG A
 
Protein sequence
MAMSEQTQPV AGAAASTTKA RTSFGILGAI SLSHLLNDMI QSLILAIYPL LQSEFSLTFM 
QIGMITLTFQ LASSLLQPVV GYWTDKYPMP WSLPIGMCFT LSGLVLLALA GSFGAVLLAA
ALVGTGSSVF HPESSRVARM ASGGRHGLAQ SIFQVGGNFG SSLGPLLAAV IIAPYGKGNV
AWFVLAALLA IVVLAQISRW YSAQHRMNKG KPKATIINPL PRNKVVLAVS ILLILIFSKY
FYMASISSYY TFYLMQKFGL SIQNAQLHLF AFLFAVAAGT VIGGPVGDKI GRKYVIWGSI
LGVAPFTLIL PYASLHWTGV LTVIIGFILA SAFSAILVYA QELLPGRIGM VSGLFFGFAF
GMGGLGAAVL GLIADHTSIE LVYKICAFLP LLGMLTIFLP DNRHKD