Gene EcSMS35_3760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3760 
SymbolnikA 
ID6144357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3825415 
End bp3826989 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content54% 
IMG OID641618586 
Productnickel ABC transporter, periplasmic nickel-binding protein NikA 
Protein accessionYP_001745726 
Protein GI170684038 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR02294] nickel ABC transporter, periplasmic nickel-binding protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTCTA CACTCCGCCG CACTCTATTT GCGCTGCTGG CTTGTGCGTC TTTTATCGTC 
CATGCCGCTG CACCAGATGA AATCACCACC GCCTGGCCGG TGAATGTCGG GCCACTAAAC
CCGCACCTTT ACACGCCTAA CCAAATGTTC GCCCAGAGCA TGGTTTATGA ACCACTGGTG
AAATATCAGG CAGACGGTTC GGTGATCCCG TGGCTGGCAA AAAGCTGGAC CCATTCAGAA
GATGGCAAAA CCTGGACCTT CACCCTGCGT GATGATGTGA AATTCTCTAA CGGTGAACCG
TTTGATGCAG AGGCCGCGGC AGAAAACTTC CGCGCTGTGC TCGACAACCG TCAACGTCAC
GCCTGGCTGG AGCTGGCAAA CCAGATTGTT GATGTCAAAG CACTCAGCAA AAACGAGCTG
CAAATCACCC TGAAAAGCGC CTACTATCCT TTCCTGCAAG AACTGGCCCT GCCGCGCCCT
TTCCGCTTTA TCGCCCCCTC GCAGTTTAAA AACCATGAAA CCATGAATGG GATTAAAGCG
CCGATTGGCA CCGGGCCGTG GGTTTTGCAG GAATCGAAAC TGAATCAGTA CGATGTCTTC
GTCCGTAACG AAAACTACTG GGGCGAAAAG CCGGCGATTA AGAAGATCAC TTTTAACGTC
ATCCCAGACC CGACTACCCG CGCAGTAGCG TTTGAAACCG GCGATATCGA TCTACTATAC
GGTAATGAAG GGCTATTACC GCTCGATACC TTCGCCCGCT TTAGCCAGAA CTCGGCTTAC
CACACCCAAC TGTCACAGCC GATCGAAACC GTGATGCTGG CGCTCAATAC TGCCAAAGCC
CCCACCAACG AGCTGGCCGT ACGTGAAGCT CTCAATTACG CGGTAAACAA AAAATCGCTG
ATCGATAACG CGTTGTATGG CACCCAGCAG GTCGCCGACA CCCTGTTTGC CCCTTCCGTG
CCCTATGCCA ATCTCGGCCT GAAACCGCGC CAGTACGATC CGCAGAAAGC GAAAGCGTTG
CTGGAACAAG CAGGCTGGAC GCTGCCCGCA GGCAAAGACA TCCGCGAGAA AAATGGTCAG
CCGCTGCGCA TTGAACTGTC GTTCATCGGC ACCGATGCGT TAAGCAAATC AATGGCCGAA
ATCATTCAGG CTGATATGCG CCAGATTGGC GCGGATGTCA CGCTGATCGG CGAAGAAGAG
AGCAGTATCT ATGCTCGTCA GCGCGACGGT CGTTTTGGCA TGATTTTCCA CCGTACCTGG
GGCGCGCCAT ACGATCCACA CGCCTTCCTG AGTTCAATGC GCGTACCGTC GCACGCTGAC
TTCCAGGCAC AGCAAGGTTT AGCCGACAAG CCACTGATTG ATAAAGAGAT TGGCGAAGTG
CTGGCGACAC ACGACGAAAC ACAACGTCAG GCACTGTATC GCGACATTCT GACCCGTCTG
CATAACGAGG CGGTTTATCT GCCCATCAGC TATATCTCAA TGATGGTGGT TTCAAAACCG
GAACTGGGTA ACATCCCCTA CGCGCCGATC GCCACCGAAA TCCCGTTCGA ACAGATTAAA
CCGGTGAAAC CTTAA
 
Protein sequence
MFSTLRRTLF ALLACASFIV HAAAPDEITT AWPVNVGPLN PHLYTPNQMF AQSMVYEPLV 
KYQADGSVIP WLAKSWTHSE DGKTWTFTLR DDVKFSNGEP FDAEAAAENF RAVLDNRQRH
AWLELANQIV DVKALSKNEL QITLKSAYYP FLQELALPRP FRFIAPSQFK NHETMNGIKA
PIGTGPWVLQ ESKLNQYDVF VRNENYWGEK PAIKKITFNV IPDPTTRAVA FETGDIDLLY
GNEGLLPLDT FARFSQNSAY HTQLSQPIET VMLALNTAKA PTNELAVREA LNYAVNKKSL
IDNALYGTQQ VADTLFAPSV PYANLGLKPR QYDPQKAKAL LEQAGWTLPA GKDIREKNGQ
PLRIELSFIG TDALSKSMAE IIQADMRQIG ADVTLIGEEE SSIYARQRDG RFGMIFHRTW
GAPYDPHAFL SSMRVPSHAD FQAQQGLADK PLIDKEIGEV LATHDETQRQ ALYRDILTRL
HNEAVYLPIS YISMMVVSKP ELGNIPYAPI ATEIPFEQIK PVKP