Gene Moth_0757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0757 
Symbol 
ID3831470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp793490 
End bp794554 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content52% 
IMG OID637828688 
Productnucleotidyl transferase 
Protein accessionYP_429618 
Protein GI83589609 
COG category[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis
[T] Signal transduction mechanisms 
COG ID[COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon)
[COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.550626 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTGGC ACGATGATAA ACTCGGGGAA ATCTGGGCCC TGCCCGGGGA GAGCCTGAAG 
CTGGCTCTGC CGCGGATGGA CGGGGCCGGC CTGCAGGTAC TTTTAGTAGG AGATACAGAA
AGGCATCTCC TCGGCATCAT CACTGACGGC GACATTCGCC GGGCGCTGCT GCGGGGCGAA
AGTCTGGACG TCCCCGTCGG GCAGGTCATG CAGGCGCGGC CCAAGGTTCT GCCGGCCGGT
GTGTCTTTAG ATGCGGCGAG AAGGTTGATG TTGACCCATA ATATTCGTCA TATCCCTTTG
GTAAATAACG AACACCAGGT AGTGGACCTG CTCCTATGGA TAGATCTATT TGGGAGTAAG
GTGGAGGCGA GGCCCGAACC GGTGGTGATT ATGGCCGGCG GCAAGGGTAC CCGGCTGGAT
CCTTTTACCA AGATTTTACC AAAACCGATG CTCCCCCTTG GCGATAAACC AATAGTAGAA
GTTTTAATGG ATAGATTTTA TGACCAGGGA TTTTCGCAGT TTATTCTTTC AGTGGGCTAT
AAAGCTGAAG TAGTAAAGTT ATATTTTAAC GATAGCAACG GCCGGCCGTA CAAAGTAAAT
TTTGTCCAGG AAGAAGAACC CCTGGGGACT GCCGGTGCCC TGGGGCTCCT GCGGCAGCAG
CTTCAGGGAA CCTTCCTGGT GACCAACTGC GACGTTATTA TTGAGATGAA TTATGGGGAA
TTGCTGCGTT ACCATCATGA GAAAGGGAAT GCCCTCACTA TAGTGGGAGC CCTGCGGGAT
TTTACCATTC CATATGGCGT TTTGCGTACC GAGGCAGGGG AATTTCACCA GATAGAAGAG
AAGCCTAGTT TCCATTTTCT GGTCAACATC GGCCTGTACG TGCTGGAACC CGAAGTTTTA
GAGGGCCTTG ATAATAGTTC TTTCATACAT ATGACGGATT TAATTATGGC CACCAAAGAT
AAAGGCCTGC GGGTGGGGGT ATACCCCCAC CATGGTCGCT GGTTCGACAT CGGCCAGTGG
GACGAGTACC GCCAGACCCT GCGGGCCTTT GAAGGCCTGA TATAA
 
Protein sequence
MGWHDDKLGE IWALPGESLK LALPRMDGAG LQVLLVGDTE RHLLGIITDG DIRRALLRGE 
SLDVPVGQVM QARPKVLPAG VSLDAARRLM LTHNIRHIPL VNNEHQVVDL LLWIDLFGSK
VEARPEPVVI MAGGKGTRLD PFTKILPKPM LPLGDKPIVE VLMDRFYDQG FSQFILSVGY
KAEVVKLYFN DSNGRPYKVN FVQEEEPLGT AGALGLLRQQ LQGTFLVTNC DVIIEMNYGE
LLRYHHEKGN ALTIVGALRD FTIPYGVLRT EAGEFHQIEE KPSFHFLVNI GLYVLEPEVL
EGLDNSSFIH MTDLIMATKD KGLRVGVYPH HGRWFDIGQW DEYRQTLRAF EGLI