Gene Moth_2431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2431 
Symbol 
ID3831661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2552098 
End bp2553039 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content52% 
IMG OID637830350 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_431256 
Protein GI83591247 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGAAA TTGAAAAGCC TAAGATAGAG TGCCAGCATC TGGACGACAA ATACGGGCGC 
TTTGTTGTGG AGCCCTTGGA ACGTGGCTAT GGGATTACCC TGGGCAACTC CCTGCGACGG
ATGCTCCTGT CATCCCTGCC GGGAGCGGCC GTTACATCGG TGAAGATCGA AGGGGTTCTC
CATGAGTTTT CAACAATCCC GGGGGTCGTT GAGGATACGA CTGATATTAT CCTGAACATC
AAGTCCCTGG CCTTGAAACT CCACAGCGAT GAACCTAGGG TAATCCGTAT TGAAGCAGAC
GATGAAGGGG TAGTCACTGC CGGTGATATT ATTACCGGCG CCGACGTTGA GATCTTAAAC
CCCGAACAGG TTATCGCCAC CGTCGAAAAG GGAGGCCGCC TTTACATGGA AATGACGGTG
GAGAAGGGGC GGGGTTACGT CAGCGCCGAT AAGAATAAGA AAGAAGACCA GCCAATCGGG
ATTATACCCG TAGATTCCCT GTTTTCACCG ATTCACAAGG TGAATTACAC AGTGGAAAAC
ACCCGGGTAG GCCAGATTAC TGATTACGAT AAGCTCACCC TGGAAGTCTG GACCGATGGC
AGTATTGCCC CCGATGAAGC AGTGAGCTCG GCTGCCAAAA TACTCATAGA GCACATGCGC
CTTTTTTTAG GGTTGACGGA GCGGGTTAGT GATGAAGTCA CCATGGTGGA AAAGGAAGAA
GAGACCAGGG ATCGACTCAT GGATATGTCC ATCGAGGAAC TCGACCTGTC AGTACGCTCC
TACAACTGCT TGAAGCGGGC CGGCATTAAC ACCGTAGCCG AGCTGTTGCA GCGTTCTGAA
GAGGATATGA TGAAGGTCCG TAACCTGGGC AAGAAGTCCC TGGAAGAGGT TACCCAGAAG
CTGAGTGAAC TGGGCCTGAG CCTGCGTTCA AGTGAAGAGT AA
 
Protein sequence
MLEIEKPKIE CQHLDDKYGR FVVEPLERGY GITLGNSLRR MLLSSLPGAA VTSVKIEGVL 
HEFSTIPGVV EDTTDIILNI KSLALKLHSD EPRVIRIEAD DEGVVTAGDI ITGADVEILN
PEQVIATVEK GGRLYMEMTV EKGRGYVSAD KNKKEDQPIG IIPVDSLFSP IHKVNYTVEN
TRVGQITDYD KLTLEVWTDG SIAPDEAVSS AAKILIEHMR LFLGLTERVS DEVTMVEKEE
ETRDRLMDMS IEELDLSVRS YNCLKRAGIN TVAELLQRSE EDMMKVRNLG KKSLEEVTQK
LSELGLSLRS SEE