Gene Mlg_0265 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0265 
Symbol 
ID4270483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp300745 
End bp301806 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content60% 
IMG OID638124990 
ProductSel1 domain-containing protein 
Protein accessionYP_741110 
Protein GI114319427 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGCGAC AAGAGCCAGT TTGCTCAGGG TTAACAAAGG TCTTGGCAAT GCCCCGCAAA 
AAAATTCTAG TGTTCATCCT CATCGTTGGC TTATGGGCCG GCACGGCTCA ACCCAACGAT
GGAGAAGACA AAACATCCCT GAAGCTCAAT GCCGAGCAGC AACAGGCTAA AGAAGAAGGA
ATGCGCCTGT GGGGCCTGCA TGAATGGATC GACATGCAGC CGCCGCTGGA AGAAGCGGCC
GGGGCCGGTG ATGTCGAGGC CATCTACTAC CTGGGCGAGG CGAACCGGCT ACTGGATCGC
GGCATGTCCC GCGAGGCCAT CGACTGGTAC CACCGCGCGG CGCAGGGCGG GGATCCCCAT
GCCATGCTCC GGCTCGAACA CGGCATGATC TGCAAGTTGG CTGACATTTG CCCCGAGAAA
TATGAAGCGT GGGTGGACAA GGCTCTCGAG CAGGAACTAC CCAAGGCCGA ACATGGTGAC
CCGATCGCCA TGTCGACCCT ATTTGATGTC TACAACATGC TCGGGGAACC CCGCACGGCC
CTAGACTGGC TGGAACGTGC CGCCGAGGCC GGAAACCCGG AGGCCCAAGA TTGGCTGGGA
ACTATCACCC AGGAACGCTC CGGCGAATGG CCCCCGCAGC TGAAAGACGT CGAAGCCGCC
GAGCCCTGGT TCCGCAAGGC CGCCGAGCAG GGCTATGCCC CGGCCATATA CAACCTCGTG
GGGAATCTAA TTCGGCAAGA AAAAATGGAA GCGGCGTGGA ACTGGGTCGT TGAAGGTTCG
GAGCGTGGGC ATATTCGGAA GCGCATTACC TACGGATTTT GTCACCTCGC CCCAGGGGAG
TTGATTGATT ACTGCTACCC GGACGAACCC GATCCCGTCA AAGGGTGGGC CATATTGCAC
GCGCTGTATG AAGAAACACG AGCTAGCACG GCCGAGAGCC TTCTGGAGCG ATACGGGGAG
CGCCTATCCG ACGAAGAAAT CGCCGAAGCC GAAGAACTCG CCGAGGACTG GCTGAACCGC
GAGCCCCCAC TGTCCTACTT CCCGCCCAAG TACGGCCTGT AG
 
Protein sequence
MRRQEPVCSG LTKVLAMPRK KILVFILIVG LWAGTAQPND GEDKTSLKLN AEQQQAKEEG 
MRLWGLHEWI DMQPPLEEAA GAGDVEAIYY LGEANRLLDR GMSREAIDWY HRAAQGGDPH
AMLRLEHGMI CKLADICPEK YEAWVDKALE QELPKAEHGD PIAMSTLFDV YNMLGEPRTA
LDWLERAAEA GNPEAQDWLG TITQERSGEW PPQLKDVEAA EPWFRKAAEQ GYAPAIYNLV
GNLIRQEKME AAWNWVVEGS ERGHIRKRIT YGFCHLAPGE LIDYCYPDEP DPVKGWAILH
ALYEETRAST AESLLERYGE RLSDEEIAEA EELAEDWLNR EPPLSYFPPK YGL