Gene Moth_1721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1721 
Symbol 
ID3833021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1764781 
End bp1765869 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content62% 
IMG OID637829646 
Productnicotinate-nucleotide-dimethylbenzimidazole phosphoribosyltransferase 
Protein accessionYP_430566 
Protein GI83590557 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2038] NaMN:DMB phosphoribosyltransferase 
TIGRFAM ID[TIGR03160] nicotinate-nucleotide--dimethylbenzimidazole phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.588177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.361889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGGAT TAAAGCTGGA GGAGACGGTT AAAGGGATCA TGCCTGTCAA CGATGCCTGG 
CGCCGGAAGG CCCGTGAGCA CTTAAACAAC CTGGCCATTC CGGTAGGAAG CCTGGGAAGG
CTGCTGGACA TCGCCGAACA ACTGGCCGCC ATAAAGGAAA GCCTGAAGCC TTCGACGGGT
AGCAAGGTGG TCGTCACCAT GGCCGGGGAC CACGGCGTTG TTGAGGAAGG GGTCAGCACC
TGTCCCCAGA GAGTGACCCT CCAGATGGTT TACAATTTTG TAGCCGGCGG GGCCGGGATC
AATGCCCTGG CCGGGGCGGC CGGGGCCAGG GTAGTGGTTG TGGATATGGG CGTGGCCGGA
GATCTGAAGG ACCTGGTGGA GCAGGGGAAG ATCCTTTCCC GCAAGGTGGA TTACGGAACG
CGCAATATGA CCAGGGGCCC TGCCATGACC AGGCAACAGG CGGTGCAGGC CCTGGAGACC
GGCATCAACA TCGCCGGAGA CCTGGTCAAT GAAGGCGTTG AACTGCTGGG AACAGGGGAT
ATGGGGATCG GCAACACCAC CCCGAGCAGC GCCATTCTGG CGGCCCTTTC CGGCCTGCCG
GTCCGGGAGG TGACGGGGAG GGGCACCGGG ATCGACGACG AGACCCTGGC AAGGAAGGTC
CAGGTGATCG AGAGGGCCCT TGCCCTGAAC AGGCCGGACC CGGGTGACCC AGTAGACGTT
CTGGCCAAGG TGGGCGGTTT CGAGATCGGG GGAATTGCGG GGTTGATTCT CGGGGCGGCC
TACTACCGGG TGCCAGTTGT GGTGGACGGA TTTATATCCA CCGCCGGTGC CCTCCTGGCG
AAACAACTCG CCCCCCGGGC GGTTGATTAC ATGATCGCCG CCCACCGGTC CATGGAGTAC
GGGCACAGGT ATATGCTCAA AGAGCTCGGC CTGCGGCCGC TGCTCGATTT AGACATGCGC
CTGGGAGAGG GTACAGGGGC TGCCCTGGCC ATGTGCATTG TAGAAGGGGC GGCGCGGGTG
ATCGGCGAGA TGCTCACCTT TGAAGATGCC GGGGTCGCCA GAAATAAGTC CAGGGAGTAT
GCGGTATGA
 
Protein sequence
MMGLKLEETV KGIMPVNDAW RRKAREHLNN LAIPVGSLGR LLDIAEQLAA IKESLKPSTG 
SKVVVTMAGD HGVVEEGVST CPQRVTLQMV YNFVAGGAGI NALAGAAGAR VVVVDMGVAG
DLKDLVEQGK ILSRKVDYGT RNMTRGPAMT RQQAVQALET GINIAGDLVN EGVELLGTGD
MGIGNTTPSS AILAALSGLP VREVTGRGTG IDDETLARKV QVIERALALN RPDPGDPVDV
LAKVGGFEIG GIAGLILGAA YYRVPVVVDG FISTAGALLA KQLAPRAVDY MIAAHRSMEY
GHRYMLKELG LRPLLDLDMR LGEGTGAALA MCIVEGAARV IGEMLTFEDA GVARNKSREY
AV