Gene Mlg_1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1472 
Symbol 
ID4269264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1679762 
End bp1682110 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content71% 
IMG OID638126228 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_742311 
Protein GI114320628 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.528687 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATA TCCCCCATCA GATTGCCGAA GAACTCGGCG TCCAGCCGCG CCAGGTGGAG 
GCCAGTGTGG CGCTCCTGGA TGGCGGCGCC ACGGTCCCGT TCATCGCCCG CTACCGCAAG
GAGGCCACCG GCGGCCTGGA CGACAGCCAG TTGCGCCGCC TGGAGGAGCG GTTGACCTAC
CTGCGCGAGC TGGCGAGCCG GCGGGAGACC GTCCTCAAGA CCATCCGGGA CCAGGGCAAG
CTTACCCCGC AACTGGAGAC GGACATCCGC GCCGCCGACA GCAAGACCCG GCTGGAGGAC
CTCTACCGCC CCTATCAGCC CAAGCGTCGC ACCAAGGCCC AGATCGCCCG GGAGCTCGGC
CTGGAGCCGC TGGCCGACAG CCTGCTGGAG GACCCCTCGC AGGCGCCGGA GGCGCTGGCC
GCGGACTATC TGGTGCCCGC GGGCCCGGAC GGCAAAGGCG GCGTCCCCGA TGCCCAGGCG
GCACTGGACG GCGCCCGTCA GATCCTGATG GAGCGCTTCG CCGAGGATGC CGAGCTGGTC
GGCCGGCTGC GCGAGTGGGC CTGGGGCAAC GCCCGGCTGC GCAGCCGGGT TATCGAGGGC
AAGGAGCGGG AGGGCGCCAA GTTCCGTGAC TACTTCGAGC ACGAGGAGCC GCTGGCCCGC
ATCCCCTCCC ACCGTGCGCT GGCCCTGTTC CGCGGCCAGA GCGAGGATAT CCTGCAATTG
AGCCTGATCT GGGAAGGGGA GGACGGCGAG AGCCCCACCG CGGAGGGCAT GATCGCGCGC
CGCTTCGGTA TCCGTCATGG GGACCGTCCC GCCGACGACT GGCTGGCCCG CACCGTGCGC
ATGAGCTGGC GCGTCAAGCT GTCGCTGCAG CTGGAACTGG CCCTGAAGCG CCGGCTGCGC
GAGGCCGCCG AGGAGGAGGC CATCCGCGTC TTCGGGCGTA ACCTCAACGA CCTGCTGCTG
GCCGCCCCGG CCGGTGCGGT CGCCACCATG GGGCTGGACC CGGGCATCCG CACCGGGGTC
AAGGTGGCCG TGGTCGACGG CACCGGCAAG GTGGTAGACA CCGCGACGGT GCACCCCTTC
CCCCCGCGCA ATGACCGCAA CGGTGCCCTG GCCACCCTCG CCGGGCTGGC CAAGCGGCAC
GCCGTCCGCC TGGTGGCCAT TGGCAATGGC ACCCATTCGC GCGAGACCGA TGCCCTGGTG
GCGGAACTGA TGCAGGCCAT GCCGGAACTC CCACTCACCC GGGTCACCGT CTCCGAGGCC
GGGGCCTCGG TCTACTCGGC GTCTGAGTAC GCCAGCCGTG AACTGCCGGA ACTGGACGTG
GCCCTGCGCG GGGCGGTCTC CATCGCCCGG CGCCTGCAGG ACCCGCTGGC CGAGCTGGTG
AAGATCGAGC CCAAGGCCAT CGGCGTCGGC CAGTACCAGC ACGATGTCAG CCAGGTGAAG
CTGGCGCGCA CCCTGGACAA CGTGGTGGAG GACTGCGTGA ACGCCGTGGG CGTGGACGTG
AACACCGCCT CCGCCCCGCT GCTCGCCCGG GTCGCCGGCC TCACCCCCGC GCTGGCCGAG
AACGTGGTCA GCCACCGCGA CCGGCACGGG CCCTTTCCCG ACCGGAAGAC CCTGCTCGAG
GTGCAGCGCC TGGGCCCCAA GGCCTACGAG CAGTGCGCCG GTTTCCTTCG GATCCGCGGT
GGCCGCAATC CCCTGGATGC CAGCGCCGTC CACCCGGAGG CCTATCCGGT GGTCGAGCGG
ATCCTCAAGG ACAGCGGCAA GTCCATTGAC CAGCTCATCG GCGACAGCCC ATTCCTGCAG
CGGCTCGAAC CGGCCCGCTA CACCGATGAC CAGTTCGGCG AGCCGACGGT GCGCGACATC
CTCGCCGAGT TGGACAAACC CGGTCGTGAC CCGCGCCCGG AGTTTCGCAC CGCCCGCTTC
CGCGAGGGGG TGAACACCCT GCAGGACCTG GCGCCGGGCA TGGTGCTGGA GGGAACGGTC
ACCAACGTGA CCAACTTCGG CGCCTTCGTC GATATCGGTG TCCACCAGGA CGGTCTGGTG
CACATCTCGG CGCTGGCCGA GCGCTTCGTG CGCGACCCGC ACGAGGTGGT CAAGGCCGGC
GACATCGTCA AGGTGAAGGT CATGGAGGTG GACCTCGAGC GCAAGCGCAT CGGCCTGTCC
ATGCGCCTGG ATGACGAACC CGGAGAGGGC CGGTCGCGCC GCGGCAGCAA GCCGGCCGAT
GGCGCCGCGG CCCGGCGGGG TAAGGGCGGC AAGGACAACC GGCCCCAGGG GGGGCGCGGC
GCCCGTCGCG AGGAGAAGCC CATGGGTGCC ATGGCCGAGG CACTGGCGGC CCTGAAAAAG
GGGCAATGA
 
Protein sequence
MTDIPHQIAE ELGVQPRQVE ASVALLDGGA TVPFIARYRK EATGGLDDSQ LRRLEERLTY 
LRELASRRET VLKTIRDQGK LTPQLETDIR AADSKTRLED LYRPYQPKRR TKAQIARELG
LEPLADSLLE DPSQAPEALA ADYLVPAGPD GKGGVPDAQA ALDGARQILM ERFAEDAELV
GRLREWAWGN ARLRSRVIEG KEREGAKFRD YFEHEEPLAR IPSHRALALF RGQSEDILQL
SLIWEGEDGE SPTAEGMIAR RFGIRHGDRP ADDWLARTVR MSWRVKLSLQ LELALKRRLR
EAAEEEAIRV FGRNLNDLLL AAPAGAVATM GLDPGIRTGV KVAVVDGTGK VVDTATVHPF
PPRNDRNGAL ATLAGLAKRH AVRLVAIGNG THSRETDALV AELMQAMPEL PLTRVTVSEA
GASVYSASEY ASRELPELDV ALRGAVSIAR RLQDPLAELV KIEPKAIGVG QYQHDVSQVK
LARTLDNVVE DCVNAVGVDV NTASAPLLAR VAGLTPALAE NVVSHRDRHG PFPDRKTLLE
VQRLGPKAYE QCAGFLRIRG GRNPLDASAV HPEAYPVVER ILKDSGKSID QLIGDSPFLQ
RLEPARYTDD QFGEPTVRDI LAELDKPGRD PRPEFRTARF REGVNTLQDL APGMVLEGTV
TNVTNFGAFV DIGVHQDGLV HISALAERFV RDPHEVVKAG DIVKVKVMEV DLERKRIGLS
MRLDDEPGEG RSRRGSKPAD GAAARRGKGG KDNRPQGGRG ARREEKPMGA MAEALAALKK
GQ