Gene Mlg_1341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1341 
Symbol 
ID4270014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1539880 
End bp1540899 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content69% 
IMG OID638126094 
Productsigma E regulatory protein, MucB/RseB 
Protein accessionYP_742180 
Protein GI114320497 
COG category[T] Signal transduction mechanisms 
COG ID[COG3026] Negative regulator of sigma E activity 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.12457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGC CCGCGACTAG CCTGCCCGGT TCGCGCGACT GGTGCCGCGG ACTCGCGGTG 
CTGATGCTGG CGCTGATCTC GGTCGGGGCC TGGGCCGACC AGGGTGACAA CCCCGGGCTG
CGCCTGCTGG AACGGATCGG CGAGCAAACC CCTCAACTGC ATTACCACGG TATCCTGGTC
TATCGCCATG GCGGCGACAT GGAGACCCTC CGCATCATCC ACCGGGGCGG CGCTGAGCAC
GAGCGCAGCG AGCGTTTCTA TACCCTCACC GGCATCCCCC GCGAGGTCAT TCGCAAGCCC
GACGAGGTCA TCTGTATCCT GCCCGACGCC GAAGCCGTGG TGGTGGGCCG GCGACAGCTG
CGCAATCCGA TTGCCCAGGC CCTGCCCCGG TATACCGAGG CGCTGCAGGA GGCCTATGAG
GTCACCCTGG CCGGCGAGGG TCGGGTGGCG GACCGGGATG CGCAACAGGT GCTGATCGTG
CCCCGCGACG ACCTGCGCTA CGGCCACCGG CTCTGGATTG ACGAGGCCTA CGGCCTGTTG
CTGCGCGCCG ATCTGCTGGA CGAGCACCAG CAGGTGCTGG AGCAGGTCAT GTTCACCGAG
GTCACCGTGG TGGAGGCGGT GCCGGATGCC TGGCTGGAGC CGGGGATCAG TGGTGAGAGC
TTCACCTGGG TGAGGCCGGC GGACCGAGCG GATGCCGCCC CGGAGCAGCG CCGTTGGCAG
GTCGCCGAGG TGCCGCCCGG CTTTCGCCTA ATCTCGCACC GCCAACGGCA GATCGCCGGT
CACGACCCCC CCGTGGAGCA CCTCCACTAC AGTGACGGTC TGGCCTCGGT ATCGGTCTAT
GTCTCCCCGC AGGCGGCCGA CAAGGTCCGG GAGCGGGCGG CCAGAATGGG GTTGATGGGC
GCGGTGCGCG TGCCCCGGGA CGGTTTCACC GTCACCGTGG TCGGCGAGGT GCCGCGTGCC
ACGCTGCACC TGTTCGCCGA GCGGCTGGCG GCCACCGGGG ATGAGGGGGC TCGACCGTGA
 
Protein sequence
MKAPATSLPG SRDWCRGLAV LMLALISVGA WADQGDNPGL RLLERIGEQT PQLHYHGILV 
YRHGGDMETL RIIHRGGAEH ERSERFYTLT GIPREVIRKP DEVICILPDA EAVVVGRRQL
RNPIAQALPR YTEALQEAYE VTLAGEGRVA DRDAQQVLIV PRDDLRYGHR LWIDEAYGLL
LRADLLDEHQ QVLEQVMFTE VTVVEAVPDA WLEPGISGES FTWVRPADRA DAAPEQRRWQ
VAEVPPGFRL ISHRQRQIAG HDPPVEHLHY SDGLASVSVY VSPQAADKVR ERAARMGLMG
AVRVPRDGFT VTVVGEVPRA TLHLFAERLA ATGDEGARP