Gene Mlg_2468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2468 
Symbol 
ID4270209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2804709 
End bp2805824 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content72% 
IMG OID638127226 
ProductBNR repeat-containing glycosyl hydrolase 
Protein accessionYP_743298 
Protein GI114321615 
COG category[R] General function prediction only 
COG ID[COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000564935 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTTCG AATTGTTATT GACAGCCTTC CTCAGGGACA GCGCCCCGGA TGCGCTGGCC 
ACGGAGGCCA GCCCGGTCCG GCCCCAACCG GTTCGGTCGG CCCTCATCCG TGTGCTCGGA
GTCCTGGGCC GCGCGGTGCT GGCCGTCGCG CCCTGGCTGT TCATCGCCGG CCTGCTTTGG
GCGGCCATCT TCGTGCGGCC TCAGCCGCTG GGCTCCACGG TGCAGCCGCC CCTGATCGAG
GAGCGGGACG CCTTCTTCGG CGCGGCCCTG CCGGCCCCGG GTGTGGCCTG GATCGTGGGC
AGCGACGGCA AGATCCTGCG CTCCGAGGGG GGCCTCGACA ACTGGCATCG CCAGCAGGCC
GGCACCCAGG AGCACCTGCA GCACATCGCC GCTTGGTCGG GCGATGAGGC CGTCGCCGTC
GGCAACGACG GCGTGGTGCT GTACACCCGG GACGGTGGTG AGACCTGGGC GGTGGGCGAT
GCCCCCCGCT CCGAGATCGC CAACAAGCTG CTGCGGGTCC GCACCGGGGC GGCGGGTGAG
GCCTGGGCGG TCGGTGAGAT GGGGGCCCTG CTCCGGACCG GCGATGGCGG TGCCACCTGG
TCCCGGGCGA TGCCGGAGGA GGATCTGGCC TGGGCCGACC TCTCCTTCAA TGGCGCCGGC
GTGGGCGTGC TCGTGGGTGA GTTCGGTGAG ATGCGCCGGA GCACCGACGG GGGCGCATCG
TGGGAAGCGC TCCCCCCTGT GGTCGACAGC AGCCTGACCG CCATCGCCTT TGCCGATGAC
GGCCGGGGCG TGGCCGTGGG TCTGGAGGGC GTGATCCTCA CCAGCACCGA TCACGGCGCC
ACCTGGACAG CGGCGGACAG CCCCACCGAG TTGCACCTGT TTGACGTGAG CTGGGACCCC
GAGGCCGGGC ATTGGCTGGC GGTGGGGGAC CAGGGAGCCT GGGTGACCGG CCGGGTCGGG
GGCGACTGGG CGAGCGGCCG GATCAGCGAG AACAGCATGC CCTGGCTGAT GGACGCTCAG
CCGGTCGGCG GCGCGGTGCT GATCGTCGGG GCGCAGGCGG GTCTCTGGGA GGGACCGGGT
GGCGGCTGGC GGCCATTCAC GACCAACGGG GAGTAA
 
Protein sequence
MRFELLLTAF LRDSAPDALA TEASPVRPQP VRSALIRVLG VLGRAVLAVA PWLFIAGLLW 
AAIFVRPQPL GSTVQPPLIE ERDAFFGAAL PAPGVAWIVG SDGKILRSEG GLDNWHRQQA
GTQEHLQHIA AWSGDEAVAV GNDGVVLYTR DGGETWAVGD APRSEIANKL LRVRTGAAGE
AWAVGEMGAL LRTGDGGATW SRAMPEEDLA WADLSFNGAG VGVLVGEFGE MRRSTDGGAS
WEALPPVVDS SLTAIAFADD GRGVAVGLEG VILTSTDHGA TWTAADSPTE LHLFDVSWDP
EAGHWLAVGD QGAWVTGRVG GDWASGRISE NSMPWLMDAQ PVGGAVLIVG AQAGLWEGPG
GGWRPFTTNG E