Gene Mlg_1825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1825 
Symbol 
ID4268180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2084825 
End bp2085853 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content65% 
IMG OID638126581 
ProductRNA polymerase, sigma 28 subunit 
Protein accessionYP_742659 
Protein GI114320976 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02394] RNA polymerase sigma factor RpoS
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGATC TCACAAGGCT TGATGCAGAG GAACCAGCAG TGGACATGGA TATAGTCGAT 
TCAGATGATA GGGACAACCG TAGTGAACCC GTGAGCCTGG ACGGGGATAC CCCCCAGGCC
CTTACCGGCG AGACCGAACC CGTTAACCGT CCAGAGCCGC GTCTGCCCCG CCGACGGACG
CAATCGGCCT ACCAGGCCTC CCTGGACGCC ACCCAGATTT ACCTCAATGA AATCGGCTAC
TCCCCGCTGC TCTCCGCGGA GGAGGAGGTC TATTTCTCCC GCCGGGCCCA GCGTGGCGAC
GCCGCCGCCC GCGCCCGCAT GATCGAAAGC AACCTGCGGC TGGTGGTCAA GATCGCGCGT
CGCTATATGA ACCGCGGTCT GGCCTTTCTC GACCTTATTG AGGAGGGCAA CCTGGGGCTC
ATCCGGGCCG TCGAGAAGTT CGATCCCGAG CGGGGCTTCC GGTTCTCCAC CTACGCCACC
TGGTGGATCC GGCAGACCAT CGAGCGGGCC ATCATGAACC AGACCCGCAC CATCCGGTTG
CCCATTCACG TCATCAAGGA GATCAACCAG TATCTGCGGG CGGCCCGCAA GCTGACCCAG
GAGCTGGACC ACGAGCCCTC GGTGGAGGAG ATCGCCGACC ATATGGGCCG TGATGTGGAG
GACGTGCGCC GGATGCGGGG GCTCAATGAG GGCACCACCT CGGTGGACGT GCCCATCGGC
CGTGATGCGG ATCGGGTGCT GCTCGACGCC ATCCCGGATG AAAACAACGT CGACCCGGTC
TCCGCGCTTC AGGACGGCGA TGTCTTTGGC AACTTGGAGG CCTGGCTGGG TGAGCTGACC
GAGAAACAGC GGGCGGTGGT GGAGCGCCGC TTCGGTCTCA ACGGCCATGA TCGGGCGACG
CTGGAGCAGG TCGGCAATGA GATCGGCGTG ACCCGTGAGC GGGTACGCCA GATCCAGATC
GACGCCCTGC GCCGGCTGCG CGAGGTCATG GAGCGGGAGG GGTTCTCCCA GGACGCCGTG
TTCGGCTGA
 
Protein sequence
MADLTRLDAE EPAVDMDIVD SDDRDNRSEP VSLDGDTPQA LTGETEPVNR PEPRLPRRRT 
QSAYQASLDA TQIYLNEIGY SPLLSAEEEV YFSRRAQRGD AAARARMIES NLRLVVKIAR
RYMNRGLAFL DLIEEGNLGL IRAVEKFDPE RGFRFSTYAT WWIRQTIERA IMNQTRTIRL
PIHVIKEINQ YLRAARKLTQ ELDHEPSVEE IADHMGRDVE DVRRMRGLNE GTTSVDVPIG
RDADRVLLDA IPDENNVDPV SALQDGDVFG NLEAWLGELT EKQRAVVERR FGLNGHDRAT
LEQVGNEIGV TRERVRQIQI DALRRLREVM EREGFSQDAV FG