Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2468 |
Symbol | |
ID | 4270209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2804709 |
End bp | 2805824 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 638127226 |
Product | BNR repeat-containing glycosyl hydrolase |
Protein accession | YP_743298 |
Protein GI | 114321615 |
COG category | [R] General function prediction only |
COG ID | [COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000564935 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTTCG AATTGTTATT GACAGCCTTC CTCAGGGACA GCGCCCCGGA TGCGCTGGCC ACGGAGGCCA GCCCGGTCCG GCCCCAACCG GTTCGGTCGG CCCTCATCCG TGTGCTCGGA GTCCTGGGCC GCGCGGTGCT GGCCGTCGCG CCCTGGCTGT TCATCGCCGG CCTGCTTTGG GCGGCCATCT TCGTGCGGCC TCAGCCGCTG GGCTCCACGG TGCAGCCGCC CCTGATCGAG GAGCGGGACG CCTTCTTCGG CGCGGCCCTG CCGGCCCCGG GTGTGGCCTG GATCGTGGGC AGCGACGGCA AGATCCTGCG CTCCGAGGGG GGCCTCGACA ACTGGCATCG CCAGCAGGCC GGCACCCAGG AGCACCTGCA GCACATCGCC GCTTGGTCGG GCGATGAGGC CGTCGCCGTC GGCAACGACG GCGTGGTGCT GTACACCCGG GACGGTGGTG AGACCTGGGC GGTGGGCGAT GCCCCCCGCT CCGAGATCGC CAACAAGCTG CTGCGGGTCC GCACCGGGGC GGCGGGTGAG GCCTGGGCGG TCGGTGAGAT GGGGGCCCTG CTCCGGACCG GCGATGGCGG TGCCACCTGG TCCCGGGCGA TGCCGGAGGA GGATCTGGCC TGGGCCGACC TCTCCTTCAA TGGCGCCGGC GTGGGCGTGC TCGTGGGTGA GTTCGGTGAG ATGCGCCGGA GCACCGACGG GGGCGCATCG TGGGAAGCGC TCCCCCCTGT GGTCGACAGC AGCCTGACCG CCATCGCCTT TGCCGATGAC GGCCGGGGCG TGGCCGTGGG TCTGGAGGGC GTGATCCTCA CCAGCACCGA TCACGGCGCC ACCTGGACAG CGGCGGACAG CCCCACCGAG TTGCACCTGT TTGACGTGAG CTGGGACCCC GAGGCCGGGC ATTGGCTGGC GGTGGGGGAC CAGGGAGCCT GGGTGACCGG CCGGGTCGGG GGCGACTGGG CGAGCGGCCG GATCAGCGAG AACAGCATGC CCTGGCTGAT GGACGCTCAG CCGGTCGGCG GCGCGGTGCT GATCGTCGGG GCGCAGGCGG GTCTCTGGGA GGGACCGGGT GGCGGCTGGC GGCCATTCAC GACCAACGGG GAGTAA
|
Protein sequence | MRFELLLTAF LRDSAPDALA TEASPVRPQP VRSALIRVLG VLGRAVLAVA PWLFIAGLLW AAIFVRPQPL GSTVQPPLIE ERDAFFGAAL PAPGVAWIVG SDGKILRSEG GLDNWHRQQA GTQEHLQHIA AWSGDEAVAV GNDGVVLYTR DGGETWAVGD APRSEIANKL LRVRTGAAGE AWAVGEMGAL LRTGDGGATW SRAMPEEDLA WADLSFNGAG VGVLVGEFGE MRRSTDGGAS WEALPPVVDS SLTAIAFADD GRGVAVGLEG VILTSTDHGA TWTAADSPTE LHLFDVSWDP EAGHWLAVGD QGAWVTGRVG GDWASGRISE NSMPWLMDAQ PVGGAVLIVG AQAGLWEGPG GGWRPFTTNG E
|
| |