Gene Mlg_0054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0054 
Symbol 
ID4270923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp57738 
End bp59315 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content71% 
IMG OID638124779 
Productsigma-54 dependent trancsriptional regulator 
Protein accessionYP_740901 
Protein GI114319218 
COG category[T] Signal transduction mechanisms
[K] Transcription 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGCAG CGGGCCGGAG CGTGACAGAC GGGGATCACG GGCAGGCCCT CGGCGCGCTC 
GCAGGGCTTG ACGCCGCCCT ACCGCTGGTG GAGAGCGAGG CCCCCGCGCA GCTCCGCCGG
CGCGCGTGCC GATTGCTTGC CCGCGCCCGG GGCCTGCCGC AGGTAGACTG CCTGGCGCTG
GATACCGATG GCCGGAGGCT GTCTGCAGGT GCCGGCAATG CGGCGTATCG GTGCGACGAC
TTCCGCCATC CCTACGCCCA CGTGATCCGC AGTGGGGCCC CCCTGCAGGC GTGCATCGGC
ACCCTGCGCG CCCGAATGGA TCATCCCGAC TTTCAGGCCG GGATGGCCGG CCTGAAAGGG
GACTGGGAAC TCCTCTGCCG CCCCTTGTGC GACCCGGATG CCCCGCGCGG CTGGCTGGGG
GTCTGGGCGC TGGTCGGTCC GCGGGAGGTG GTGGCTGCCC TGGACGGCGA TCCCGGCTTC
ATGGCCCTGG AAGGGCTCCT CTGCCGGTTG TGGTGCCGGT TGCTGACAGC GGCGAAGGCG
CACCGCCAGG GCCAGGATCT GCGCCACTCC CTGCAACGGT TGGGGCGGGA CACGCGAGCG
CATGCGCTGT CCGGCGCCTT GAGCCGCGAA TTGATTGGCA GCTCTCCGGC CATGTCCCGG
TTGCGCGATC AGGTGATCCG CGCGGCCGGC ACACGCCTGG CCGTCCTCCT GCAGGGCGAG
ACCGGGACCG GCAAGGAGCG TGTCGCCCGC GCGATTCACC GCCACTCGGC ACAGGGCGAT
GGGCCCTTCG TGGCGATCAA CTGTGCGGCG ATCCCGGAGA CGCTGCTGGA GTCCGAGCTC
TTCGGCCACG TGCGCGGCGC CCACTCCAGC GCCACCCGGG ATCGCACCGG TCTGCTGGCC
GCGGCCGACG GCGGCACCCT GTTCCTGGAC GAGATCGGCG ATATGCCGCC AGCCTTGCAG
GCCAAGCTGC TCCGGGTGCT GGAGAGTGGG TGGTACCGGC CGCTGGGTGG CGGACGCGAG
CGGCGTGCCG ACCTCCGGCT GGTGTCCGCC ACCCACCAAC CCTTGACGGC CCGCATCCGT
GAAGGCCGGT TTCGTGCCGA TCTCTATTAC CGACTGAACC AGTTTCCGCT TCGTCTGCCC
GCGCTGCGGG AACGCCGGGC GGACATCCCC GAACTGGCGG ATCATTTCGC GGCCGCCTAC
GCCGCCAGAG AGGGGCGCCC CCGGGCCAGC CTGAGTCCGA CGGCGCTGAC ACACCTGGCG
GCCCGGGACT TTCCCGGGAA CCTGCGGGAA CTGCGCAACC AGGTGGAGTA TGGCTGTGCC
ATGGCCCCGG CCGGTCAGCC CATCGGGCCG GCGGACCTGC CGCTGAGCGA GACCCGGGAA
CGGACGCCGG TGGAGGGCCT GCCGGGCCCG GCGTTCAATC TCCGGGAGGT GGTGCGTGAC
TACGAGGCCC GCCTCATCCG CGAAAAACTG CGCCAGTTCA ACGGCAACCG GGCCAAGACG
GCGGTCAGCC TGGGGCTCCC GAAGAGGACG CTGGCCCACA AGTGCCGGAA GCTGCAACTG
GATGAGGACC CGGCATGA
 
Protein sequence
MSAAGRSVTD GDHGQALGAL AGLDAALPLV ESEAPAQLRR RACRLLARAR GLPQVDCLAL 
DTDGRRLSAG AGNAAYRCDD FRHPYAHVIR SGAPLQACIG TLRARMDHPD FQAGMAGLKG
DWELLCRPLC DPDAPRGWLG VWALVGPREV VAALDGDPGF MALEGLLCRL WCRLLTAAKA
HRQGQDLRHS LQRLGRDTRA HALSGALSRE LIGSSPAMSR LRDQVIRAAG TRLAVLLQGE
TGTGKERVAR AIHRHSAQGD GPFVAINCAA IPETLLESEL FGHVRGAHSS ATRDRTGLLA
AADGGTLFLD EIGDMPPALQ AKLLRVLESG WYRPLGGGRE RRADLRLVSA THQPLTARIR
EGRFRADLYY RLNQFPLRLP ALRERRADIP ELADHFAAAY AAREGRPRAS LSPTALTHLA
ARDFPGNLRE LRNQVEYGCA MAPAGQPIGP ADLPLSETRE RTPVEGLPGP AFNLREVVRD
YEARLIREKL RQFNGNRAKT AVSLGLPKRT LAHKCRKLQL DEDPA