Gene ECH74115_4816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4816 
SymbolnikA 
ID6967632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4451349 
End bp4452923 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content54% 
IMG OID643388508 
Productnickel ABC transporter, periplasmic nickel-binding protein NikA 
Protein accessionYP_002272936 
Protein GI209399938 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR02294] nickel ABC transporter, periplasmic nickel-binding protein 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTCCA CACTCCGCCG CACTCTATTT GCGCTGCTGG CTTGTGCGTC TTTTATCGTC 
CATGCCGCTG CACCAGATGA AATCACCACC GCCTGGCCGG TGAATGTCGG GCCACTAAAC
CCGCACCTTT ACACGCCTAA CCAGATGTTC GCCCAGAGCA TGGTTTATGA ACCATTGGTG
AAATATCAGG CAGACGGTTC GGTGATCCCG TGGCTGGCAA AAAGCTGGAC TCATTCAGAA
GATGGTAAAA CCTGGACCTT CACCCTGCGT GATGACGTGA AATTCTCTAA CGGCGAACCG
TTTGATGCAG AGGCGGCGGC AGAAAACTTC CGCGCAGTGC TCGATAACCG TCAACGTCAC
GCCTGGCTGG AGCTGGCAAA CCAGATTGTT GATGTTAAAG CACTCAGTAA AACAGAGCTG
CAAATCACCC TGAAAAGCGC CTACTATCCT TTCCTGCAAG AACTGGCCCT GCCGCGCCCT
TTCCGCTTTA TCGCCCCCTC GCAGTTTAAA AACCATGAAA CCATGAACGG GATTAAAGCG
CCGATTGGCA CCGGACCGTG GGTTTTGCAG GAATCGAAAC TGAATCAGTA CGATGTCTTC
GTCCGTAACG AAAACTACTG GGGCGAAAAG CCGGCAATTA AGAAGATCAC TTTTAACGTC
ATCCCAGACC CGACTACCCG CGCGGTGGCG TTTGAAACTG GCGATATCGA CCTGCTGTAC
GGAAACGAAG GGTTATTACC GCTCGATACC TTCGCCCGCT TTAGCCAGAA CCCGGCTTAC
CATACCCAAC TGTCACAGCC GATCGAAACC GTGATGCTGG CGCTTAATAC CGCCAAAGCC
CCCACCAATG AGCTGGCAGT ACGTGAAGCT CTTAATTACG CGGTAAACAA AAAATCGCTG
ATTGATAACG CGTTGTATGG CACCCAGCAG GTCGCCGACA CCCTGTTTGC CCCTTCTGTG
CCCTACGCCA ACCTCGGCCT GAAACCGCGC CAGTACGATC CGCAGAAAGC GAAAGCGTTG
CTGGAAAAAG CCGGTTGGAC GCTGCCTGCG GGCAAAGACA TCCGCGAGAA AAATGGTCAG
CCGCTGCACA TTGAACTTTC GTTCATCGGC ACCGATGCGT TAAGCAAATC GATGGCGGAA
ATCATTCAGG CTGATATGCG CCAGATTGGC GCAGATGTCT CGCTGATTGG CGAAGAAGAG
AGCAGTATCT ATGCGCGTCA GCGCGACGGT CGTTTTGGCA TGATTTTCCA CCGCACCTGG
GGCGCGCCAT ACGATCCACA CGCCTTCCTG AGTTCAATGC GCGTACCGTC GCACGCTGAC
TTCCAGGCAC AGCAAGGATT AGCCGACAAG CCGCTGATTG ATAAAGAGAT CGGCGAAGTG
CTGGCGACCC ATGACGAAGC ACAACGTCAG GCACTGTATC GCGACATTCT GACCCGTCTG
CATGACGAGG CGGTTTATCT GCCTATCAGT TACATCTCAA TGATGGTGGT ATCAAAACCG
GAGCTGGGTA ACATCCCCTA CGCGCCGATC GCCACCGAAA TTCCGTTCGA ACAGATTAAA
CCGGTGAAAC CTTAA
 
Protein sequence
MLSTLRRTLF ALLACASFIV HAAAPDEITT AWPVNVGPLN PHLYTPNQMF AQSMVYEPLV 
KYQADGSVIP WLAKSWTHSE DGKTWTFTLR DDVKFSNGEP FDAEAAAENF RAVLDNRQRH
AWLELANQIV DVKALSKTEL QITLKSAYYP FLQELALPRP FRFIAPSQFK NHETMNGIKA
PIGTGPWVLQ ESKLNQYDVF VRNENYWGEK PAIKKITFNV IPDPTTRAVA FETGDIDLLY
GNEGLLPLDT FARFSQNPAY HTQLSQPIET VMLALNTAKA PTNELAVREA LNYAVNKKSL
IDNALYGTQQ VADTLFAPSV PYANLGLKPR QYDPQKAKAL LEKAGWTLPA GKDIREKNGQ
PLHIELSFIG TDALSKSMAE IIQADMRQIG ADVSLIGEEE SSIYARQRDG RFGMIFHRTW
GAPYDPHAFL SSMRVPSHAD FQAQQGLADK PLIDKEIGEV LATHDEAQRQ ALYRDILTRL
HDEAVYLPIS YISMMVVSKP ELGNIPYAPI ATEIPFEQIK PVKP