Gene Msil_2991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2991 
Symbol 
ID7093486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3302545 
End bp3303561 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content61% 
IMG OID643466302 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_002363264 
Protein GI217979117 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.52801 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTCAAA AAAACTGGCA AGAGCTCACC AAACCCAACA AGCTTGAAGT CATTTCGGGC 
GACGATCCGA AGCGTTTTGC GACCATCGTC GCCGGGCCGC TCGAACTCGG CTTCGGGCTC
ACGCTTGGCA ATTCCCTGCG CCGCATCCTC CTGTCGTCGT TGCAGGGCGC GGCGATCACC
TCCGTCCATA TCGACGGCGT GCTGCATGAA TTTTCGTCGA TTCCCGGCGT GCGCGAGGAC
GTGACCGACA TTGTCCTCAA CATCAAGGAC ATCGCGATCA AGATGCCGGG CGACGGGCCG
AAGCGGCTCG TGCTCAAGAA GCAGGGGCCC GGCAAGGTCA CCGCCGGCGA CATCCAGACC
AGCGGCGACA TTTCGATCCT GAACCCCGGC CTAGTGATCT GCACCCTCGA CGAGGGCGCC
GAGATCCGCA TGGAGTTCAC GGTCCACACC GGCAAGGGCT ATGTCGCAGC GGACCGCAAC
CGGGCCGAGG ACGCGCCGAT CGGCCTCATT CCGATCGACA GCCTCTATTC GCCGGTAAAG
AAGGTCAGCT ACCGCGTCGA AAACACGCGC GAGGGTCAGA ACCTCGACCT CGACAAGCTG
ACGCTGCAGG TCGAGACCAA TGGCGCGCTG ACGCCGGAAG ACGCCGTCGC CTTCGCCGCC
CGCATCCTGC AGGATCAGCT CAACGTCTTC GTCAATTTCG AGGAGCCGCG CCGCGTCGAG
GCGACGCCGT CGATCCCGGA GCTCGCCTTC AATCCGGCGC TCTTGAAAAA GGTCGACGAG
CTCGAGCTTT CGGTGCGTTC GGCGAACTGC CTGAAGAACG ACAATATCGT CTATATAGGC
GACCTCATCC AGAAGAGCGA AGGCGAGATG CTGCGCACGC CGAATTTCGG CCGCAAATCC
TTGAACGAAA TCAAGGAAGT GCTCGCGCAG ATGGGCCTGC ACCTCGGCAT GGAGGTGAAT
GGCTGGCCGC CGGACAATAT CGACGACCTC GCCAAGCGCT TCGAGGAGCA TTACTGA
 
Protein sequence
MIQKNWQELT KPNKLEVISG DDPKRFATIV AGPLELGFGL TLGNSLRRIL LSSLQGAAIT 
SVHIDGVLHE FSSIPGVRED VTDIVLNIKD IAIKMPGDGP KRLVLKKQGP GKVTAGDIQT
SGDISILNPG LVICTLDEGA EIRMEFTVHT GKGYVAADRN RAEDAPIGLI PIDSLYSPVK
KVSYRVENTR EGQNLDLDKL TLQVETNGAL TPEDAVAFAA RILQDQLNVF VNFEEPRRVE
ATPSIPELAF NPALLKKVDE LELSVRSANC LKNDNIVYIG DLIQKSEGEM LRTPNFGRKS
LNEIKEVLAQ MGLHLGMEVN GWPPDNIDDL AKRFEEHY