Gene Mlg_2516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2516 
Symbol 
ID4268771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2859217 
End bp2861013 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content65% 
IMG OID638127275 
ProductRNA polymerase, sigma 70 subunit, RpoD 
Protein accessionYP_743346 
Protein GI114321663 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.329688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAGG ATCAGCAGTC TCAGATCAAG CAGCTCATCG CCAAGGGCAA GGAGCAGGGG 
TTCCTCACCT ATGCCGAGGT GAACGACCAC CTGCCCGATG ACATCGTCGA TCCCGAGCAG
ATCGAAGACA TCATTGGCAT GATCAACGAC ATGGGTATCA ATGTCCATGA GGTCGCGCCG
GATGCCGATG AGTTGCTGCT CAGTGATGCT GCGGTCAGTA CCGACGAGGA CGAGGCCGAA
GAGGCGGCGG CGGCGCTGGC GGCGGTGGAC GCCGAGTTTG GCCGCACCAC CGACCCGGTG
CGCATGTACA TGCGCGAGAT GGGTACGGTG GAGCTGCTGA CCCGTGAGGG CGAGATCGAG
CTGGCCAAGC GCATCGAAGA GGGGTTGGAC CAGGTGCTGG CCGCGCTCTC CGCCTATCCC
GGTGCGGCGG CCAAGCTGCT CGACCTGTAC CGCAAGGTGC AGGACGGTGA GATGCGCCTG
AACGAACTGA TGGGCGGCTT CCGCAACCCC GACGAGGAAC TCTCCGCCTA CTCCGCCGAC
GCCGGCAAGG AGGAGGACGA GGACGGCGAG GAGGCCGTGG TCGACAACGG GCCCGACCCG
GAGATGGCCG CTGAGCTGTT CCGGCGGCTG GCGGAGGCGG ATCAGCGCAT GCAGGAGGTC
CTGCGGCGAC AGGGCTCCGA CAGCCCCGAG TGCGCGGAAC TGCGTGAGCA GCTGGCGGAG
ATCTTCCTGA CCTTCAAGTT CCCGCCGAAG ATCATCGACC AGCTGGTGGA TGGGCTGCGC
CGGGATGTCA ACGTGCTGCG CCGCAACGAG CGCCACATCC TCAAGGCCTG CACCAAGGCG
GGCATGCCGC GCAAGACCTT CGTGCGCAGC TTCATCTCCC GCGAGACCGA TCCGGGCTGG
CTGGACGAGA TGCTGGCCAG CAAAGAGCCG TGGGCCCAAC GCCTGGCCGA GCACGAGGAC
GATATCCGCC GTGCCCAGCA GGTGCTGATC GACGCCGAGC GTAAGGTGGG TATGACCATC
GGCGAGATCA AGGACATCAA CCGGCGCATG TCCATCGGCG AGGCCAAGGC CCGCCGGGCG
AAGAAGGAGA TGGTGGAGGC CAACCTGCGG CTGGTCATCT CCATCGCTAA GAAGTACACC
AACCGCGGCC TGCAGTTCCT GGACCTCATC CAGGAGGGCA ACATCGGCCT GATGAAGGCG
GTGGACAAGT TCGAATACCG CCGGGGTTAC AAGTTCTCCA CCTACGCCAC CTGGTGGATC
CGGCAGGCCA TCACCCGCTC CATCGCCGAC CAGGCCCGCA CCATCCGCAT CCCGGTGCAC
ATGATCGAGA CCATCAACAA GCTCAACCGG GTCTCTCGGC AGATGCTCCA GGAGATGGGC
CGGGAGCCGA GCCCGGACGA GCTGGCTGAG CGCATGGAGA TGCCCGAGGA CAAGGTGCGC
AAGGTGCTCA AGATCGCCAA GGAGCCGATC TCCATGGAGA CGCCCATTGG CGACGACGAG
GACTCCCACC TGGGGGATTT CATCGAGGAC ATCAACGCCA TGTCCCCGGT GGATTCCGCC
ACCCGGGAGG GGCTGCGCGA ATCGGTCAAG GGCGTGCTCT CCGGCCTGAC CCCCCGGGAG
GCCAAGGTGC TGCGCATGCG CTTTGGCATC GACATGAACA CCGACCACAC CCTGGAAGAG
GTCGGCAAGC AGTTTGACGT CACCCGCGAG CGCATCCGCC AGATCGAGGC CAAGGCCCTG
CGCAAGCTAC GCCACCCGAC CCGCTCCGAG GGTCTGCGCA GTTTCCTCGA CGAGTAA
 
Protein sequence
MTQDQQSQIK QLIAKGKEQG FLTYAEVNDH LPDDIVDPEQ IEDIIGMIND MGINVHEVAP 
DADELLLSDA AVSTDEDEAE EAAAALAAVD AEFGRTTDPV RMYMREMGTV ELLTREGEIE
LAKRIEEGLD QVLAALSAYP GAAAKLLDLY RKVQDGEMRL NELMGGFRNP DEELSAYSAD
AGKEEDEDGE EAVVDNGPDP EMAAELFRRL AEADQRMQEV LRRQGSDSPE CAELREQLAE
IFLTFKFPPK IIDQLVDGLR RDVNVLRRNE RHILKACTKA GMPRKTFVRS FISRETDPGW
LDEMLASKEP WAQRLAEHED DIRRAQQVLI DAERKVGMTI GEIKDINRRM SIGEAKARRA
KKEMVEANLR LVISIAKKYT NRGLQFLDLI QEGNIGLMKA VDKFEYRRGY KFSTYATWWI
RQAITRSIAD QARTIRIPVH MIETINKLNR VSRQMLQEMG REPSPDELAE RMEMPEDKVR
KVLKIAKEPI SMETPIGDDE DSHLGDFIED INAMSPVDSA TREGLRESVK GVLSGLTPRE
AKVLRMRFGI DMNTDHTLEE VGKQFDVTRE RIRQIEAKAL RKLRHPTRSE GLRSFLDE