Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0483 |
Symbol | |
ID | 4268351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 530140 |
End bp | 531135 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638125223 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_741327 |
Protein GI | 114319644 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGGGA CCTTCAGGGA TTTCCTCAAG CCGCGCACCG TCGACATCCA GGAACAGGGT GAGCGACGCG CGAAGATTGT GCTGGAGCCC CTCGAGCGGG GCTTCGGCCA CACGCTGGGC AACGCCCTGC GCCGCGTGTT GCTGTCCTCC ATGCCGGGCA GCGCCGTTGT CCAGGCAGAG ATCGAGGGTG TCGAGCACGA GTACAGCAGC ATGGAGGGGG TTCAGGAAGA TGTTGTCGAC ATCCTGCTGA ACCTCAAGAG CCTGGCCGTG CGGATGCACG ATCGGGACGA GGCCGAGCTC ACGGTGTCGG TGCAGGGGCC TGGGCCGGTC ACCGCGGGCG ATATCCAGAC CGCCCACGAT GTCGAGGTCA AGAACCCGGA GCTCCTCATT TGCACGCTCA CCAAGGCGGT GGCCTTCAAC GCCAAGCTGA TGGTCGCCCG CGGGCGGGGC TACGAGGCCG CGACCCAGCG TGATGGGGAC GAGGACCGGG TCATCGGGCG CCTGCAGCTC GACGCCAGCT ACAGCCCGGT GAAGCGGGTG GCCTACACGG TGGAGAGCGC CCGCGTTGAG CAGCGGACCA ACCTGGACAA GCTGGTCCTC GATGTGGAGA CCAACGGTGT GCTGGAGCCG GAGGAGGCGG TGCGTTTCGC CGCCGGCCTG CTGCGCGATC AGCTCTCGGT GTTCGTGGAC CTGGAAGGCG GCGAGTTCGA GGCCGAGCAG GAGGAGCAGG AGCCCGACGT GGATCCGATC CTGCTGCGTC CGATCGATGA GCTGGAGCTG ACCGTCCGGT CCGCCAACTG CCTCAAGGCC GAGAGCATCC ACTACGTGGG TGACCTGGTG CAGCGCACTG AGGTCGAGCT GTTGAAGACG CCGAATCTGG GCAAGAAGTC CCTGACCGAA ATCAAGGAGA CACTGGCCTC CCACGGCCTG TCCCTTGGTA TGAGGCTGGA AAACTGGCCG CCGGCCGGTC TGGGCGAAGA TCGCGTCGTG GGCTGA
|
Protein sequence | MQGTFRDFLK PRTVDIQEQG ERRAKIVLEP LERGFGHTLG NALRRVLLSS MPGSAVVQAE IEGVEHEYSS MEGVQEDVVD ILLNLKSLAV RMHDRDEAEL TVSVQGPGPV TAGDIQTAHD VEVKNPELLI CTLTKAVAFN AKLMVARGRG YEAATQRDGD EDRVIGRLQL DASYSPVKRV AYTVESARVE QRTNLDKLVL DVETNGVLEP EEAVRFAAGL LRDQLSVFVD LEGGEFEAEQ EEQEPDVDPI LLRPIDELEL TVRSANCLKA ESIHYVGDLV QRTEVELLKT PNLGKKSLTE IKETLASHGL SLGMRLENWP PAGLGEDRVV G
|
| |