Gene Moth_1836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1836 
Symbol 
ID3832806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1893633 
End bp1894673 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content63% 
IMG OID637829767 
Productnicotinate phosphoribosyltransferase 
Protein accessionYP_430679 
Protein GI83590670 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1488] Nicotinic acid phosphoribosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0347625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGACCG AGGTCATCAC TTCCCTGGAG CAGGTACAAC AATTAGAGGT CAAACCGGAC 
CGGCGGTTCT ATTCGGCCGA GCACGGGGAG ATTGCCAGCG GGGCGACTAC GGATATTTAT
TTTGTCCGCA CCTATGAGAT TCTTAAAAGC CTGGGCAAGG TCGACACGGT AGTTACGGCC
GAGATCTTTC CCCGCCGGGC CGGGATCCTC TGCGGGGTCA ACGAGGTCCT GGAGCTTTTG
CGGGACAAAA AGGTGACCGT TTACGGCCTG CCGGAGGGGA GCCCCTTTGA GCCGAAAGAG
GTGGTCATGC GCATCCAGGG TCCCTATAGC GAGTTTGGCC TCTTTGAAAC TACCTTGCTG
GGAATGCTGG CCAGCTCCAG CGGCTGGGCT ACGGCGGCCC GGGAAATCAG GGAAGCGGCT
GGTGAACATC CCTTTGTCTG CTTCGGGGCG CGCCACGTTC ACCCGGCGGT GGCGCCGGTC
ATGGAGCGGG CGGCCATTGT CGGCGGCGCC GACGGGGCGA GTTGCATCCT GGCGGCCAAA
CTGGCCGGCC GGGAGCCCCA GGGAACGGTA CCCCATGCGG TATTCCTGAT CATCGGCGAT
ACAGTCGAGG GGGCCCTGGC CTACGACCGC CTCATGCCCC CTGACGCCAA GCGGACCATC
CTGATCGACA CCTTTAAAGA TGAGGCTGAA GAGGCCCTGC GGGTAGCCAG TGCCCTGGGG
CCGGCCCTGG CCGGGGTACG TTTGGATACC CCCAGCGAGC GAGGCGGCGT CACCCCGGAA
CTGGTCCGGG AAGTGCGCTA TCGCCTGGAT ATGGCCGGCT TTAACCATGT GGGGATTTTT
GTCTCCGGAG GCCTGACGCC GGAACGTATC CGGACCCTTA TCGAAGCCGG GGCCGACGCC
TTCGGCGTGG GCAGCTATAT TTCCGGCGCG GCCCCCATTG ATATGACCAT GGACTTAAAG
GAGGTCGACG GCCGCCCGGT GGCCAAACGC GGCCGCCTGC CGGGGATCAT TCCCAATCCC
CGGCTGGTGC AGTTGAAATA G
 
Protein sequence
MGTEVITSLE QVQQLEVKPD RRFYSAEHGE IASGATTDIY FVRTYEILKS LGKVDTVVTA 
EIFPRRAGIL CGVNEVLELL RDKKVTVYGL PEGSPFEPKE VVMRIQGPYS EFGLFETTLL
GMLASSSGWA TAAREIREAA GEHPFVCFGA RHVHPAVAPV MERAAIVGGA DGASCILAAK
LAGREPQGTV PHAVFLIIGD TVEGALAYDR LMPPDAKRTI LIDTFKDEAE EALRVASALG
PALAGVRLDT PSERGGVTPE LVREVRYRLD MAGFNHVGIF VSGGLTPERI RTLIEAGADA
FGVGSYISGA APIDMTMDLK EVDGRPVAKR GRLPGIIPNP RLVQLK