Gene EcHS_A2114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2114 
SymbolcobT 
ID5594700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2101990 
End bp2103069 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content52% 
IMG OID640921253 
Productnicotinate-nucleotide--dimethylbenzimidazole phosphoribosyltransferase 
Protein accessionYP_001458792 
Protein GI157161474 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2038] NaMN:DMB phosphoribosyltransferase 
TIGRFAM ID[TIGR03160] nicotinate-nucleotide--dimethylbenzimidazole phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000000168624 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACAC TTGCCGATTT ACTGAATACG ATCCCTGCTA TCGATCCTGC CGCTATGTCG 
CGTGCACAAC GGCATATTGA CGGGTTACTC AAACCTGTTG GTAGCCTGGG AAGGCTGGAG
GCGCTTGCCA TACAACTGGC GGGAATGCCG GGGTTGAATG GCATACCGCA TGTGGGCAAA
AAAGCGGTAC TGGTTATGTG TGCCGATCAC GGCGTCTGGG AGGAAGGGGT CGCTATTTCC
CCAAAAGAAG TGACAGCCAT TCAGGCTGAA AATATGACCC GCGGAACAAC CGGCGTGTGT
GTGCTGGCAG CACAAGCGGG CGCTAACGTC CACGTAGTTG ATGTTGGTAT TGATAGTGCT
GAGCCTATCC CCGGGCTTAT CAACATGCGT GTCGCACGAG GTAGCGGCAA TATTGCTTCA
GCTCCGGCAA TGAGTCGCCG TCAGGCTGAA AAGTTGCTTT TGGACGTCAT ATGTTATACG
CGGGAGCTGG CAAAAAACGG TGTCACGCTG TTTGGTGTAG GTGAACTGGG GATGGCAAAC
ACGACACCGG CAGCGGCAAT AGTCAGCACA ATCACTGGCC GGGATCCTGA AGAAGTGGTT
GGGATTGGCG CAAACCTGCC GACAGATAAA CTGGCTAATA AAATTGATGT TGTGCGTCGG
GCGATTACGT TGAATCAACC AAATCCTCAG GATGGTATTG ATGTCCTGGC AAAAGTGGGT
GGATTTGATT TGGTCGGAAT AGCTGGAGTG ATGTTAGGTG CTGCTTCCTG CGGTTTACCC
GTGTTGCTGG ATGGATTTCT TTCTTATGCT GCTGCGCTCG CAGCCTGCCA GATGTCTCCT
GCAATCAAAC CGTATCTCAT TCCTTCTCAC TTGTCGGTAG AAAAAGGCGC GCGTATAGCG
CTCTCGCATT TGGGGCTGGA GCCTTATCTC AATATGGATA TGCGTTTAGG TGAGGGGAGT
GGTGCAGCTC TGGCGATGCC CATCATCGAA GCTGCTTGTG CGATATACAA CAACATGGGC
GAACTTGCTG CCAGTAATAT TGTTCTACCG GGGAATACGA CTTCTGATTT GAACAGCTAA
 
Protein sequence
MQTLADLLNT IPAIDPAAMS RAQRHIDGLL KPVGSLGRLE ALAIQLAGMP GLNGIPHVGK 
KAVLVMCADH GVWEEGVAIS PKEVTAIQAE NMTRGTTGVC VLAAQAGANV HVVDVGIDSA
EPIPGLINMR VARGSGNIAS APAMSRRQAE KLLLDVICYT RELAKNGVTL FGVGELGMAN
TTPAAAIVST ITGRDPEEVV GIGANLPTDK LANKIDVVRR AITLNQPNPQ DGIDVLAKVG
GFDLVGIAGV MLGAASCGLP VLLDGFLSYA AALAACQMSP AIKPYLIPSH LSVEKGARIA
LSHLGLEPYL NMDMRLGEGS GAALAMPIIE AACAIYNNMG ELAASNIVLP GNTTSDLNS