Gene Mlg_2230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2230 
Symbol 
ID4269445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2532778 
End bp2534295 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content66% 
IMG OID638126986 
ProductRNA polymerase, sigma 54 subunit, RpoN 
Protein accessionYP_743062 
Protein GI114321379 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCAACGG TCATGAAACA ATCGCTGCAA CTTCGACTCG GGCAGCAACT CACTATGACG 
CCACAGCTGC AACAGGCCAT CCGCCTGCTG CAGCTCTCCA CCCTGGATCT GCAGATGGAG
ATCCAGCAGC AGCTCGAGTC CAACGTCATG CTGGAACTCG CCGAAGAGGA CCAGCAGGAG
ACCGAACAGC GCACCGAAGA GGGCGAGACC CAGGACACGG AGGGCGAGCG CGGGGACGAG
GCCAGCACCG AGGAGGCCAC CGAGGCCGGG ACCGCGGGCG AGGAGACCGC CGCCCAGGAG
CAGGCCGACA TCCCCGAGGA CCTGCCGCTG GACTCCAATT GGGATGACAT CTACGACGGC
AGCACCCCCT GGAGCCAGCC GGACAGCGAG GACGAGGACC GCGACCCCTA CGCCAACCGC
TCCGGTGGCG GTGAAACCCT GCACGACCAC CTGACCTGGC AGGCGGAGCT CACCCCCTTC
ACCGATCGGG ACGCGGCCAT CGCACAGGTC ATCATCGACT CCGTGCGCGA CGACGGCTAC
CTGGGCGCCG GCATCGAGGA GTTGATCTCC GCCCTGCCGG CGGAGTGGGC CGTGGAGCCC
GATGAGGTCG AGGCGGTGCT CCGGCGCATC CAGCACTTCG ACCCGGTGGG CGTGGCCGCC
CGCGATCCGC GTGAGGCGCT GCTCATCCAG CTCGAGCAGC TGCCGCCGGA CACCCCACTG
CTGCCCGAGG CCCGGCGCCT GGTGGATCTG CACCTGGACA TGCTGGTACA GCGCCAGTAC
GCCCAACTCT GTCGGCGCAT GAAGCTCAAT CAGGACCAGC TTCGCGAGGT CCTGGGCCTG
ATCCAGACCC TCGATCCGCG GCCGGGCTCG CAAATCGGTG GCGACGAGAC CCAGTACGTG
GTCCCCGACG TGGTGGTCCG GCGCAGCGAC GGCCGCTGGC AGGTGGAGCT CAATCCCGCG
ACCGCCCCCC GGCTGCGGGT CAACAGCTAT TACGCCAGCC TGATCAAACG CGCCGACAAC
AGCAGCGACA ACACCACCCT GCGCAACCAC CTGCAGGAGG CGCGCTGGTT CATCAAGAGC
CTGCTTAGCC GCAACGACAC CCTGCTCAAG GTCGCCCGCT GCATCGTCGA GCGCCAGCAG
GGCTACTTCG ACCACGGCGA AGAGGCCATG CAGCCGTTGG TCCTGCGCGA GGTGGCCGAG
GCGGTGGACA TGCACGAGTC CACCATCTCC CGGATCACCA CCCGCAAGTA CATGCACACC
CCGCGGGGCA CCCTGGAGTT CAAGTACTTC TTCTCCAGCC ATGTGCAGAC GGTGGACGGC
GGCGAGTGCT CCGCCACCGC CATCCGTGCC CGCATCCGGC GTCTGATCGC CGATGAGAAC
CCCACCAAAC CACTCAGTGA CAGTCGTATT GCCAATATCC TCCAGGAGGA GGGCATAAAC
GTGGCAAGAC GGACCGTAGC CAAGTATCGT GAGGCTATGG CCATCGCGTC CTCGTCAGAG
CGCAAGCGAC TGGCCTGA
 
Protein sequence
MATVMKQSLQ LRLGQQLTMT PQLQQAIRLL QLSTLDLQME IQQQLESNVM LELAEEDQQE 
TEQRTEEGET QDTEGERGDE ASTEEATEAG TAGEETAAQE QADIPEDLPL DSNWDDIYDG
STPWSQPDSE DEDRDPYANR SGGGETLHDH LTWQAELTPF TDRDAAIAQV IIDSVRDDGY
LGAGIEELIS ALPAEWAVEP DEVEAVLRRI QHFDPVGVAA RDPREALLIQ LEQLPPDTPL
LPEARRLVDL HLDMLVQRQY AQLCRRMKLN QDQLREVLGL IQTLDPRPGS QIGGDETQYV
VPDVVVRRSD GRWQVELNPA TAPRLRVNSY YASLIKRADN SSDNTTLRNH LQEARWFIKS
LLSRNDTLLK VARCIVERQQ GYFDHGEEAM QPLVLREVAE AVDMHESTIS RITTRKYMHT
PRGTLEFKYF FSSHVQTVDG GECSATAIRA RIRRLIADEN PTKPLSDSRI ANILQEEGIN
VARRTVAKYR EAMAIASSSE RKRLA