Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2991 |
Symbol | |
ID | 7093486 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 3302545 |
End bp | 3303561 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643466302 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_002363264 |
Protein GI | 217979117 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.52801 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATTCAAA AAAACTGGCA AGAGCTCACC AAACCCAACA AGCTTGAAGT CATTTCGGGC GACGATCCGA AGCGTTTTGC GACCATCGTC GCCGGGCCGC TCGAACTCGG CTTCGGGCTC ACGCTTGGCA ATTCCCTGCG CCGCATCCTC CTGTCGTCGT TGCAGGGCGC GGCGATCACC TCCGTCCATA TCGACGGCGT GCTGCATGAA TTTTCGTCGA TTCCCGGCGT GCGCGAGGAC GTGACCGACA TTGTCCTCAA CATCAAGGAC ATCGCGATCA AGATGCCGGG CGACGGGCCG AAGCGGCTCG TGCTCAAGAA GCAGGGGCCC GGCAAGGTCA CCGCCGGCGA CATCCAGACC AGCGGCGACA TTTCGATCCT GAACCCCGGC CTAGTGATCT GCACCCTCGA CGAGGGCGCC GAGATCCGCA TGGAGTTCAC GGTCCACACC GGCAAGGGCT ATGTCGCAGC GGACCGCAAC CGGGCCGAGG ACGCGCCGAT CGGCCTCATT CCGATCGACA GCCTCTATTC GCCGGTAAAG AAGGTCAGCT ACCGCGTCGA AAACACGCGC GAGGGTCAGA ACCTCGACCT CGACAAGCTG ACGCTGCAGG TCGAGACCAA TGGCGCGCTG ACGCCGGAAG ACGCCGTCGC CTTCGCCGCC CGCATCCTGC AGGATCAGCT CAACGTCTTC GTCAATTTCG AGGAGCCGCG CCGCGTCGAG GCGACGCCGT CGATCCCGGA GCTCGCCTTC AATCCGGCGC TCTTGAAAAA GGTCGACGAG CTCGAGCTTT CGGTGCGTTC GGCGAACTGC CTGAAGAACG ACAATATCGT CTATATAGGC GACCTCATCC AGAAGAGCGA AGGCGAGATG CTGCGCACGC CGAATTTCGG CCGCAAATCC TTGAACGAAA TCAAGGAAGT GCTCGCGCAG ATGGGCCTGC ACCTCGGCAT GGAGGTGAAT GGCTGGCCGC CGGACAATAT CGACGACCTC GCCAAGCGCT TCGAGGAGCA TTACTGA
|
Protein sequence | MIQKNWQELT KPNKLEVISG DDPKRFATIV AGPLELGFGL TLGNSLRRIL LSSLQGAAIT SVHIDGVLHE FSSIPGVRED VTDIVLNIKD IAIKMPGDGP KRLVLKKQGP GKVTAGDIQT SGDISILNPG LVICTLDEGA EIRMEFTVHT GKGYVAADRN RAEDAPIGLI PIDSLYSPVK KVSYRVENTR EGQNLDLDKL TLQVETNGAL TPEDAVAFAA RILQDQLNVF VNFEEPRRVE ATPSIPELAF NPALLKKVDE LELSVRSANC LKNDNIVYIG DLIQKSEGEM LRTPNFGRKS LNEIKEVLAQ MGLHLGMEVN GWPPDNIDDL AKRFEEHY
|
| |