Gene Moth_0915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0915 
Symbol 
ID3831304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp950452 
End bp951582 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content65% 
IMG OID637828846 
Product5-amino-6-(5-phosphoribosylamino)uracil reductase / diaminohydroxyphosphoribosylaminopyrimidine deaminase 
Protein accessionYP_429775 
Protein GI83589766 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0117] Pyrimidine deaminase
[COG1985] Pyrimidine reductase, riboflavin biosynthesis 
TIGRFAM ID[TIGR00227] riboflavin-specific deaminase C-terminal domain
[TIGR00326] riboflavin biosynthesis protein RibD 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000729701 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACCAC AGGATGCCAT CTTCATGGCC CGGGCCCTGG AACTGGCCCG CCAGGGACTG 
GGCCGTACCA GCCCCAACCC GACAGTAGGA GCAGTTATCG TTCGCGACGG CCAGGTGGTA
GGTGAGGGTT ATCACCAGAA GGCTGGCACC CCCCACGCTG AGATCCATGC CCTGCGTGCC
GCCGGGGAGA AGGCCCGGGG AGCCACCCTG TATGTAACCC TGGAACCCTG CTGCCACTAC
GGCCGGACGC CCCCGTGCAC CGAGGCCATC ATCGCCGCCG GGATCAAGAG GGTGGTAGCA
GCCATGGCCG ATCCCAATCC CCGGGTAGCG GGAGGCGGCT TCCGGGCCCT GAGCCAGGCC
GGGATAGAGG TAGAGACGGG ACTGCTGGCA GATGAAGCCC GGAGATTAAA TGAGGCCTTT
ATTAAGTATA TTACTACCGG CAGGCCCTGG GTGACCCTGA AGATGGCCCT GACCCTGGAC
GGGAAGATCG CCACCCGTAC CGGGGCCGCC CGCTGGATTA CCGGCCCGGC TGCCAGGCAG
AGGGCCCATG AGTTGCGGGA TATCCATGAT GCCATTCTAG TGGGCATTGG CACCGTCCTG
GCCGACGATC CCGAATTAAC CACCCGCCTG CCGGACGGCC GGGGGCGGGA CGCTATCAGG
GTAATCCTGG ACAGCCACCT GCGGCTGCCC CTGAGTGCCA GGGTGGTAAA CCTCCAATCG
GAAGCCCCTA CTCTGGTGGT CACCACACCT TCGGCCCCGG CAGCAGCCAG GGAAAACCTT
GCGGCCCGGG GAGTGGAGGT TCTGGTCCTG CCGGAGGAAG ACGGCCGGGT GGCCTGGCAA
CCGCTCCTGG CCGAGCTAGC CAGGCGCCAG GTCACCAGTA TCCTGGTCGA GGGAGGAGCC
GAGGTAAACG CCACCGCCCT GGCGGCCGGG ATTGTCGACA AGGTGGTGGC CTTTATCGCC
CCTAAAATCT TCGGGGGCAG AGAGGCACCG GCACCGGTGG GCGGCCTGGG GGTCGCCGAC
CCGGCGACCG CCTGGAAATT AGAAAAACTG GCCGTGGAGC GCTGCGGCGA AGATATCATG
TTGAGCGGGT ACCTGCTCAA AAGAGGGGAA GAGCCTTGTT TACCGGGTTA A
 
Protein sequence
MQPQDAIFMA RALELARQGL GRTSPNPTVG AVIVRDGQVV GEGYHQKAGT PHAEIHALRA 
AGEKARGATL YVTLEPCCHY GRTPPCTEAI IAAGIKRVVA AMADPNPRVA GGGFRALSQA
GIEVETGLLA DEARRLNEAF IKYITTGRPW VTLKMALTLD GKIATRTGAA RWITGPAARQ
RAHELRDIHD AILVGIGTVL ADDPELTTRL PDGRGRDAIR VILDSHLRLP LSARVVNLQS
EAPTLVVTTP SAPAAARENL AARGVEVLVL PEEDGRVAWQ PLLAELARRQ VTSILVEGGA
EVNATALAAG IVDKVVAFIA PKIFGGREAP APVGGLGVAD PATAWKLEKL AVERCGEDIM
LSGYLLKRGE EPCLPG