Gene EcSMS35_1133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1133 
SymbolcobT 
ID6145478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1152500 
End bp1153579 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content52% 
IMG OID641616011 
Productnicotinate-nucleotide--dimethylbenzimidazole phosphoribosyltransferase 
Protein accessionYP_001743203 
Protein GI170682841 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2038] NaMN:DMB phosphoribosyltransferase 
TIGRFAM ID[TIGR03160] nicotinate-nucleotide--dimethylbenzimidazole phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000016088 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.000108099 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAACAC TTGCCGATTT ACTGAATACG ATCCCTGCCA TCGATCCTGC CGCGATGTCG 
CGTGCACAAC GGCATATTGA CGGGTTACTC AAACCTGTTG GTAGCCTGGG AAGGCTGGAG
GCGCTTGCCA TACAACTGGC AGGAATGCCG GGGTTGAATG GCATACCGCA TGTGGGCAAA
AAAGCGGTAC TGGTTATGTG TGCCGATCAC GGCGTCTGGG AGGAAGGGGT CGCTATTTCC
CCAAAAGAAG TGACAGCCAT TCAGGCTGAA AATATGACCC GCGGAACAAC CGGCGTGTGT
GTGCTGGCAG CACAAGCGGG CGCTAACGTC TACGTAATTG ATGTTGGTAT TGATACTGCT
GAGCCTATCC CCGGGCTTAT CAACATGCGT GTTGCTCGAG GTAGTGGCAA TATTGCTTCA
GCTCCGGCAA TGAGTCGCCA TCAGGCAGAA AAGTTGCTTT TGGACGTCAT ATGTTATACG
CGGGAGCTGG CAAAAAACGG TGTTACGCTG TTTGGTGTAG GTGAACTGGG GATGGCAAAC
ACGACACCGG CAGCGGCAAT AGTCAGCACA ATCACTGGCC GGGCTCCTGA AGAAGTGGTT
GGGATTGGCG CAAACCTGCC GACAGATAAA CTGGCTAATA AAATTGATGT TGTGCGCCGG
GCGATTACGT TGAATCAACC AAATCCGCAG GATGGCGTTG ATGTCCTGGC AAAAGTGGGT
GGATTTGATT TGGTCGGAAT GGCTGGCGTG ATGTTAGGCG CTGCTTCCTG CGGTTTACCC
GTGTTGCTGG ATGGATTTCT TTCTTATGCT GCCGCGCTCG CAGCCTGCCA GATGTCTCCT
GCAATCAAAC CGTATCTCAT TCCTTCTCAC CTGTCGGCAG AAAAAGGGGC GCGTATTGCG
CTGTCGCATT TGGGGCTGGA GCCTTTTCTC AATATGGAGA TGCGTTTAGG TGAGGGGAGT
GGCGCAGCTC TGGCGATGCC CATCATCGAA GCTGCCTGTG CGATATATAA CAACATGGGC
GAACTTGCTG CCAGTAATAT TGTTCTACCG GGGAATACGA CTTCTGATTT GAACAGTTAA
 
Protein sequence
MQTLADLLNT IPAIDPAAMS RAQRHIDGLL KPVGSLGRLE ALAIQLAGMP GLNGIPHVGK 
KAVLVMCADH GVWEEGVAIS PKEVTAIQAE NMTRGTTGVC VLAAQAGANV YVIDVGIDTA
EPIPGLINMR VARGSGNIAS APAMSRHQAE KLLLDVICYT RELAKNGVTL FGVGELGMAN
TTPAAAIVST ITGRAPEEVV GIGANLPTDK LANKIDVVRR AITLNQPNPQ DGVDVLAKVG
GFDLVGMAGV MLGAASCGLP VLLDGFLSYA AALAACQMSP AIKPYLIPSH LSAEKGARIA
LSHLGLEPFL NMEMRLGEGS GAALAMPIIE AACAIYNNMG ELAASNIVLP GNTTSDLNS