Gene EcHS_A0556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0556 
Symbolfsr 
ID5591388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp569541 
End bp570761 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content52% 
IMG OID640919740 
Productfosmidomycin resistance protein 
Protein accessionYP_001457324 
Protein GI157160006 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATGA GTGAACAACC CCAGCCTGTG GCGGGCGCGG CTGCGTCAAC GACCAAGGCC 
CGAACATCGT TTGGTATTTT AGGTGCTATC AGCCTCTCAC ATCTGCTGAA CGACATGATC
CAATCGCTGA TTCTGGCGAT TTATCCGCTG CTTCAGTCAG AATTTTCTCT GACATTTATG
CAGATTGGCA TGATAACCCT CACCTTCCAG CTCGCCTCTT CGCTACTGCA ACCAGTGGTC
GGCTACTGGA CCGATAAATA TCCGATGCCG TGGTCGTTGC CAATTGGCAT GTGCTTTACC
TTAAGTGGTC TGGTGCTGCT TGCGCTGGCG GGCAGTTTTG GCGCAGTTCT GCTGGCGGCG
GCGCTGGTCG GTACCGGTTC ATCGGTCTTT CATCCGGAAT CTTCTCGCGT GGCCCGTATG
GCTTCCGGCG GGCGGCATGG CCTGGCACAA TCTATCTTTC AGGTCGGCGG CAACTTTGGC
AGTTCCCTGG GACCCTTGCT GGCGGCGGTG ATTATCGCGC CTTATGGCAA AGGCAACGTT
GCCTGGTTTG TGCTTGCGGC ACTGCTGGCG ATCGTGGTGT TGGCACAAAT CAGCCGTTGG
TACTCGGCAC AGCACCGAAT GAATAAAGGA AAACCCAAAG CGACGATAAT CAATCCACTG
CCGCGCAACA AAGTGGTACT GGCAGTCAGC ATTCTGTTAA TCCTCATTTT CTCGAAATAT
TTCTATATGG CGAGCATCAG CAGCTATTAC ACCTTTTATC TGATGCAAAA ATTCGGATTA
TCTATCCAGA ATGCCCAGCT TCATCTGTTT GCCTTCCTGT TTGCCGTTGC GGCAGGTACG
GTGATCGGCG GGCCTGTAGG GGATAAAATT GGACGGAAAT ATGTGATTTG GGGCTCTATC
CTCGGCGTTG CGCCGTTTAC GCTGATTTTA CCCTACGCCA GCCTGCACTG GACGGGGGTT
TTAACGGTGA TTATTGGATT TATCCTCGCT TCGGCATTCT CTGCCATTCT GGTCTACGCT
CAGGAGCTAC TTCCGGGACG TATCGGTATG GTTTCTGGAC TCTTTTTCGG TTTTGCTTTT
GGCATGGGAG GTCTGGGAGC GGCAGTTCTG GGGCTTATCG CCGATCACAC CAGCATCGAG
TTAGTCTATA AAATCTGTGC TTTCCTGCCA CTATTGGGGA TGTTGACCAT ATTCCTGCCT
GATAACCGGC ATAAAGACTG A
 
Protein sequence
MAMSEQPQPV AGAAASTTKA RTSFGILGAI SLSHLLNDMI QSLILAIYPL LQSEFSLTFM 
QIGMITLTFQ LASSLLQPVV GYWTDKYPMP WSLPIGMCFT LSGLVLLALA GSFGAVLLAA
ALVGTGSSVF HPESSRVARM ASGGRHGLAQ SIFQVGGNFG SSLGPLLAAV IIAPYGKGNV
AWFVLAALLA IVVLAQISRW YSAQHRMNKG KPKATIINPL PRNKVVLAVS ILLILIFSKY
FYMASISSYY TFYLMQKFGL SIQNAQLHLF AFLFAVAAGT VIGGPVGDKI GRKYVIWGSI
LGVAPFTLIL PYASLHWTGV LTVIIGFILA SAFSAILVYA QELLPGRIGM VSGLFFGFAF
GMGGLGAAVL GLIADHTSIE LVYKICAFLP LLGMLTIFLP DNRHKD