Gene EcSMS35_2189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2189 
SymbolpncB 
ID6143475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2197090 
End bp2198292 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content51% 
IMG OID641617065 
Productnicotinate phosphoribosyltransferase 
Protein accessionYP_001744239 
Protein GI170682315 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1488] Nicotinic acid phosphoribosyltransferase 
TIGRFAM ID[TIGR01514] nicotinate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.084522 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.514532 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAAT TCGCTTCTCC TGTTCTGCAC TCGTTGCTGG ATACAGATGC TTATAAGTTG 
CATATGCAGC AAGCCGTGTT TCATCACTAT TACGATGTGC ATGTCGCGGC GGAGTTTCGT
TGCCGAGGTG ACGATCTGCT GGGTATTTAT GCCGATGCTA TTCGTGAACA GGTTCAGGCG
ATGCAGCACC TGCGCCTGCA GGATGATGAA TATCAGTGGC TCTCTGCCCT GCCTTTCTTT
CAGGCCGACT ATCTTAACTG GTTACGCGAG TTCCGCTTTA ACCCGGAACA AGTCACCGTA
TCCAACGATA ATGGCAAGCT GGATATTCGT TTAAGCGGCC CGTGGCGTGA AGTCATCCTC
TGGGAAGTTC CTTTGCTGGC GGTTATCAGT GAAATGGTAC ATCGCTATCG CTCACCGCAG
ACCGACGTTG CGCAAGCCCT CGACACGCTG GAAAGCAAAT TAGTCGACTT CTCGGCATTA
ACCGCCGGTC TTGATATGTC GCGCTTCCAT CTGATGGATT TTGGCACCCG CCGCCGTTTT
TCTCGCGAAG TACAAGAAAC CATCGTTAAG CGTCTGCAAC AGGAATCCTG GTTCGTGGGC
ACCAGCAACT ATGATCTGGC GCGCCGGCTT TCCCTCACGC CGATGGGAAC ACAGGCACAC
GAATGGTTCC AGGCGCATCA GCAAATCAGC CCGGATCTAG CCAACAGCCA GCGAGCTGCA
CTTGCTGCCT GGCTGGAAGA GTATCCCGAC CAACTTGGCA TTGCATTAAC CGACTGCATC
ACTATGGATG CTTTCCTGCG TGATTTCGGT GTCGAGTTCG CCAGTCGCTA TCAGGGCCTG
CGTCATGACT CTGGCGACCC GGTTGAATGG GGTGAAAAAG CCATTGCACA TTATGAGAAG
CTGGGAATTG ATCCACAGAG TAAAACGCTG GTTTTCTCTG ACAATCTGGA TTTACGCAAA
GCGGTTGAGC TATACCGCCA TTTCTCTTCC CGCGTGCAAT TAAGTTTTGG TATTGGGACT
CGCCTGACCT GCGATATCCC CCAGGTAAAA CCCCTGAATA TTGTCATTAA GTTGGTTGAG
TGTAACGGTA AACCGGTGGC GAAACTTTCT GACAGCCCTG GCAAAACTAT CTGCCACGAT
AAAGCGTTTG TTCGGGCACT GCGCAAAGCA TTCGACCTTC CGCATATTAA AAAAGCCAGT
TAA
 
Protein sequence
MTQFASPVLH SLLDTDAYKL HMQQAVFHHY YDVHVAAEFR CRGDDLLGIY ADAIREQVQA 
MQHLRLQDDE YQWLSALPFF QADYLNWLRE FRFNPEQVTV SNDNGKLDIR LSGPWREVIL
WEVPLLAVIS EMVHRYRSPQ TDVAQALDTL ESKLVDFSAL TAGLDMSRFH LMDFGTRRRF
SREVQETIVK RLQQESWFVG TSNYDLARRL SLTPMGTQAH EWFQAHQQIS PDLANSQRAA
LAAWLEEYPD QLGIALTDCI TMDAFLRDFG VEFASRYQGL RHDSGDPVEW GEKAIAHYEK
LGIDPQSKTL VFSDNLDLRK AVELYRHFSS RVQLSFGIGT RLTCDIPQVK PLNIVIKLVE
CNGKPVAKLS DSPGKTICHD KAFVRALRKA FDLPHIKKAS