Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2516 |
Symbol | |
ID | 4268771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2859217 |
End bp | 2861013 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638127275 |
Product | RNA polymerase, sigma 70 subunit, RpoD |
Protein accession | YP_743346 |
Protein GI | 114321663 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.329688 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACAGG ATCAGCAGTC TCAGATCAAG CAGCTCATCG CCAAGGGCAA GGAGCAGGGG TTCCTCACCT ATGCCGAGGT GAACGACCAC CTGCCCGATG ACATCGTCGA TCCCGAGCAG ATCGAAGACA TCATTGGCAT GATCAACGAC ATGGGTATCA ATGTCCATGA GGTCGCGCCG GATGCCGATG AGTTGCTGCT CAGTGATGCT GCGGTCAGTA CCGACGAGGA CGAGGCCGAA GAGGCGGCGG CGGCGCTGGC GGCGGTGGAC GCCGAGTTTG GCCGCACCAC CGACCCGGTG CGCATGTACA TGCGCGAGAT GGGTACGGTG GAGCTGCTGA CCCGTGAGGG CGAGATCGAG CTGGCCAAGC GCATCGAAGA GGGGTTGGAC CAGGTGCTGG CCGCGCTCTC CGCCTATCCC GGTGCGGCGG CCAAGCTGCT CGACCTGTAC CGCAAGGTGC AGGACGGTGA GATGCGCCTG AACGAACTGA TGGGCGGCTT CCGCAACCCC GACGAGGAAC TCTCCGCCTA CTCCGCCGAC GCCGGCAAGG AGGAGGACGA GGACGGCGAG GAGGCCGTGG TCGACAACGG GCCCGACCCG GAGATGGCCG CTGAGCTGTT CCGGCGGCTG GCGGAGGCGG ATCAGCGCAT GCAGGAGGTC CTGCGGCGAC AGGGCTCCGA CAGCCCCGAG TGCGCGGAAC TGCGTGAGCA GCTGGCGGAG ATCTTCCTGA CCTTCAAGTT CCCGCCGAAG ATCATCGACC AGCTGGTGGA TGGGCTGCGC CGGGATGTCA ACGTGCTGCG CCGCAACGAG CGCCACATCC TCAAGGCCTG CACCAAGGCG GGCATGCCGC GCAAGACCTT CGTGCGCAGC TTCATCTCCC GCGAGACCGA TCCGGGCTGG CTGGACGAGA TGCTGGCCAG CAAAGAGCCG TGGGCCCAAC GCCTGGCCGA GCACGAGGAC GATATCCGCC GTGCCCAGCA GGTGCTGATC GACGCCGAGC GTAAGGTGGG TATGACCATC GGCGAGATCA AGGACATCAA CCGGCGCATG TCCATCGGCG AGGCCAAGGC CCGCCGGGCG AAGAAGGAGA TGGTGGAGGC CAACCTGCGG CTGGTCATCT CCATCGCTAA GAAGTACACC AACCGCGGCC TGCAGTTCCT GGACCTCATC CAGGAGGGCA ACATCGGCCT GATGAAGGCG GTGGACAAGT TCGAATACCG CCGGGGTTAC AAGTTCTCCA CCTACGCCAC CTGGTGGATC CGGCAGGCCA TCACCCGCTC CATCGCCGAC CAGGCCCGCA CCATCCGCAT CCCGGTGCAC ATGATCGAGA CCATCAACAA GCTCAACCGG GTCTCTCGGC AGATGCTCCA GGAGATGGGC CGGGAGCCGA GCCCGGACGA GCTGGCTGAG CGCATGGAGA TGCCCGAGGA CAAGGTGCGC AAGGTGCTCA AGATCGCCAA GGAGCCGATC TCCATGGAGA CGCCCATTGG CGACGACGAG GACTCCCACC TGGGGGATTT CATCGAGGAC ATCAACGCCA TGTCCCCGGT GGATTCCGCC ACCCGGGAGG GGCTGCGCGA ATCGGTCAAG GGCGTGCTCT CCGGCCTGAC CCCCCGGGAG GCCAAGGTGC TGCGCATGCG CTTTGGCATC GACATGAACA CCGACCACAC CCTGGAAGAG GTCGGCAAGC AGTTTGACGT CACCCGCGAG CGCATCCGCC AGATCGAGGC CAAGGCCCTG CGCAAGCTAC GCCACCCGAC CCGCTCCGAG GGTCTGCGCA GTTTCCTCGA CGAGTAA
|
Protein sequence | MTQDQQSQIK QLIAKGKEQG FLTYAEVNDH LPDDIVDPEQ IEDIIGMIND MGINVHEVAP DADELLLSDA AVSTDEDEAE EAAAALAAVD AEFGRTTDPV RMYMREMGTV ELLTREGEIE LAKRIEEGLD QVLAALSAYP GAAAKLLDLY RKVQDGEMRL NELMGGFRNP DEELSAYSAD AGKEEDEDGE EAVVDNGPDP EMAAELFRRL AEADQRMQEV LRRQGSDSPE CAELREQLAE IFLTFKFPPK IIDQLVDGLR RDVNVLRRNE RHILKACTKA GMPRKTFVRS FISRETDPGW LDEMLASKEP WAQRLAEHED DIRRAQQVLI DAERKVGMTI GEIKDINRRM SIGEAKARRA KKEMVEANLR LVISIAKKYT NRGLQFLDLI QEGNIGLMKA VDKFEYRRGY KFSTYATWWI RQAITRSIAD QARTIRIPVH MIETINKLNR VSRQMLQEMG REPSPDELAE RMEMPEDKVR KVLKIAKEPI SMETPIGDDE DSHLGDFIED INAMSPVDSA TREGLRESVK GVLSGLTPRE AKVLRMRFGI DMNTDHTLEE VGKQFDVTRE RIRQIEAKAL RKLRHPTRSE GLRSFLDE
|
| |