Gene Mlg_0620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0620 
Symbol 
ID4270602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp670333 
End bp671757 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content72% 
IMG OID638125367 
ProductFmu (Sun) domain-containing protein 
Protein accessionYP_741464 
Protein GI114319781 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.00109548 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGTGCTG TCCGTCCCGA CAACCCGCTG TTCCGCTACC GCGATGTGGT GGACGACTGG 
CCGGCCTTCG AGACGGCGGT GGCCAGTCCG CTGCCGGCGA CCATCTGGGC CCACCCGGCG
CGGATCACCC GCGAGGCCCT GCGCTCGCTG CTGGCTGAGG CGGGTATCGA TGCCCGGCCG
GTGGCCTGGC AGCCCTCGGC CCTGCGTCTG CCCGCCGGGC TGCGTCCGGG GGCGCATTGG
GGGCAGGCCG CCGGGCTTTA CCATGCGCAG GAGGAGGCCA GCATGGTGCC GGTGACGCTG
CTGGATCCGC AGCCCGGTCA CCGGGTGCTC GACCTGTGCG CCGCCCCCGG CAACAAGACC
GCCCAGGCGG CACTGGCGCT GGGAAACCGT GGCACCGTGG TGGCTAACGA TGTGGCCAAG
GGGCGGCTGG CCGCCATCCG TCATCTGATC AAGCGGCTGG GGCTGATGAA CGTCTCGGTC
ACCTGTCGGC CGGCACAGGA CTATTCCCCC CACGCGGGCG GTTTCGACCG GGTCATCGCC
GACGTGCCCT GTAGCTGCGA GGGCACGGTG CGCAAGTCCA GCCAGCCCTC CGCCCAGGCC
CTGTGCGAGA CCCGTGAGCG GCTTGTCGCG CGGCAGACGG CCATTCTCGA CAAGGCGGTG
CGGCTCTGCC GCCCGGGCGG GCGTATCGTC TACTCCACCT GCACCTTTGC CCCGGAGGAG
AACGAACAGG TGGTGGATGC CCTGCTGCGC CGCTACCCGG ACGAACTGCG CTTGCTGCCG
GTGAGCCTGC CCGGGCTGTA CTTGGCACCG GGGCTCACCG GGTGGCGGGG CCGGACCTTC
CTGCCGGAGC TGGCCCAGGC GGTCCGGCTC TGGCCCCACC ACAACAACAC CGGGGGGTTT
TTCATCGCCC TGCTGGAGCG GGTGGGCGGC GAGTGCCGCG AGGGGGCGCG GGCGGCAGAA
CGGCCGACGG ACGGTCACTG GCTGGAGGGT ATTGTCCAGC GCTACGCCAT TCCCGGGTCG
GCCCTGGCCG GGTTGCGGCT GGTCCACCGT GGCCGCAAAT ACGCCCAGCT CATTGCCGCG
GACCACGAGC CGCCGGCCCG GCCGGAGCGG GTGTTCTTCG GTCTGCCGAC CGTCGGGGTG
CAGATGAAGC CGCCCAAACT CACCACCGCC GGGGTGATGG CGCTGGGCGC TTACGCCCGG
CGCAACGTGG TGGAACTGGA CGAAAGCCAG GCGCAGGCCT ATACCCGCCG CCAGGTGGTG
ACGCTGCAAC CGGGGCAGGG CATCGGGCTG GGGGAGAGCG GTTCGGTGGT GGTCCGGCGC
CGGGGCTATG GGCTGGGCCT GGGCGTGGTC AGCCCGGCGG CGGAGGATGG CAGCCGGCAC
CTGGCCAGCC TCTACCCGCG AAGCTGGGCC GCTGAGGTGC CCTGA
 
Protein sequence
MGAVRPDNPL FRYRDVVDDW PAFETAVASP LPATIWAHPA RITREALRSL LAEAGIDARP 
VAWQPSALRL PAGLRPGAHW GQAAGLYHAQ EEASMVPVTL LDPQPGHRVL DLCAAPGNKT
AQAALALGNR GTVVANDVAK GRLAAIRHLI KRLGLMNVSV TCRPAQDYSP HAGGFDRVIA
DVPCSCEGTV RKSSQPSAQA LCETRERLVA RQTAILDKAV RLCRPGGRIV YSTCTFAPEE
NEQVVDALLR RYPDELRLLP VSLPGLYLAP GLTGWRGRTF LPELAQAVRL WPHHNNTGGF
FIALLERVGG ECREGARAAE RPTDGHWLEG IVQRYAIPGS ALAGLRLVHR GRKYAQLIAA
DHEPPARPER VFFGLPTVGV QMKPPKLTTA GVMALGAYAR RNVVELDESQ AQAYTRRQVV
TLQPGQGIGL GESGSVVVRR RGYGLGLGVV SPAAEDGSRH LASLYPRSWA AEVP