Gene EcHS_A1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1039 
SymbolpncB 
ID5592248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1050770 
End bp1051972 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content51% 
IMG OID640920206 
Productnicotinate phosphoribosyltransferase 
Protein accessionYP_001457771 
Protein GI157160453 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1488] Nicotinic acid phosphoribosyltransferase 
TIGRFAM ID[TIGR01514] nicotinate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.00414982 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACAAT TCGCTTCTCC TGTTCTGCAC TCGTTGCTGG ATACAGATGC TTATAAGTTG 
CATATGCAGC AAGCCGTGTT TCATCATTAT TACGATGTGC ATGTCGCGGC GGAGTTTCGT
TGCCGAGGTG ACGATCTGCT GGGTATTTAT GCCGATGCTA TTCGTGAACA GATTCAGGCG
ATGCAGCACC TGCGCCTGCA GGATGATGAA TATCAGTGGC TTTCTGCCCT GCCTTTCTTT
AAGGCCGACT ATCTTAACTG GTTACGCGAG TTCCGCTTTA ACCCGGAACA AGTCACCGTG
TCCAACGATA ATGGCAAGCT GGATATTCGT TTAAGCGGCC CGTGGCGTGA AGTCATCCTC
TGGGAAGTTC CTTTGCTGGC GGTTATCAGT GAAATGGTAC ATCGCTATCG CTCACCGCAG
GCCGACGTTG CGCAAGCCCT CGACACGCTG GAAAGCAAAT TAGTCGACTT CTCGGCGTTA
ACCGCCGGTC TTGATATGTC GCGCTTCCAT CTGATGGATT TTGGCACCCG TCGCCGTTTT
TCTCGCGAAG TACAAGAAAC CATCGTTAAG CGTCTGCAAC AGGAATCCTG GTTTGTGGGC
ACCAGCAACT ACGATCTGGC GCGTCGGCTT TCCCTCACGC CGATGGGAAC ACAGGCACAC
GAATGGTTCC AGGCACATCA GCAAATCAGC CCGGATCTAG CCAACAGCCA GCGAGCTGCA
CTTGCTGCCT GGCTGGAAGA GTATCCCGAC CAACTTGGCA TTGCATTAAC CGACTGCATC
ACTATGGATG CTTTCCTGCG TGATTTCGGT GTCGAGTTCG CTAGTCGGTA TCAAGGCCTG
CGTCATGACT CTGGCGACCC GGTTGAATGG GGTGAAAAAG CCATTGCACA TTATGAAAAG
CTGGGAATTG ATCCACAGAG TAAAACGCTG GTTTTCTCTG ACAATCTGGA TTTACGCAAA
GCGGTTGAGC TATACCGCCA CTTCTCTTCC CGCGTGCAAT TAAGTTTTGG TATTGGGACT
CGCCTGACCT GCGATATCCC CCAGGTAAAA CCCCTGAATA TTGTGATTAA GTTGGTAGAG
TGTAACGGTA AACCGGTGGC GAAACTTTCT GACAGCCCTG GCAAAACTAT CTGCCACGAT
AAAGCGTTTG TTCGGGCGCT GCGCAAAGCG TTCGACCTTC CGCATATTAA AAAAGCCAGT
TAA
 
Protein sequence
MTQFASPVLH SLLDTDAYKL HMQQAVFHHY YDVHVAAEFR CRGDDLLGIY ADAIREQIQA 
MQHLRLQDDE YQWLSALPFF KADYLNWLRE FRFNPEQVTV SNDNGKLDIR LSGPWREVIL
WEVPLLAVIS EMVHRYRSPQ ADVAQALDTL ESKLVDFSAL TAGLDMSRFH LMDFGTRRRF
SREVQETIVK RLQQESWFVG TSNYDLARRL SLTPMGTQAH EWFQAHQQIS PDLANSQRAA
LAAWLEEYPD QLGIALTDCI TMDAFLRDFG VEFASRYQGL RHDSGDPVEW GEKAIAHYEK
LGIDPQSKTL VFSDNLDLRK AVELYRHFSS RVQLSFGIGT RLTCDIPQVK PLNIVIKLVE
CNGKPVAKLS DSPGKTICHD KAFVRALRKA FDLPHIKKAS