Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1208 |
Symbol | |
ID | 4270696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1407958 |
End bp | 1408938 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638125957 |
Product | hypothetical protein |
Protein accession | YP_742047 |
Protein GI | 114320364 |
COG category | [S] Function unknown |
COG ID | [COG3272] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.569232 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.186411 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGG CCAGCTTTAT GCCGCCCCGT GACCACACCA AGATCCCCGT TGGGATCAGC TCCTGCCTGC TGGGGGAGCC GGTGCGCTAT GATCGGGGCC ACAAGCGCAG CGCTTTCATC ACTGATATCC TGGCCGCGTA TTTCGACTTC CGCCCCGTCT GTCCGGAGGT GGCAATCGGG CTGGGTGTGC CACGGCCACC CATTCGGCTC GCCGGTGACC CGGAGCGACC GCGGGCGGTC GGGGTGCGGG ATCCGCGCGT GGACGTCACA GCGCCGTTGC GCGCCTACGG CGAGCGCATG GCCGGTGAGC TCGATTACAT CAGTGGCTAC ATCTTCAAGA GCAAGTCCCC CAGTTGCGGG CTGTTCCGGG TGAAGGTCTA CGGCGAGTCG GGCAGAGCGC CCTCCGCGAC CGGTCGCGGG CTCTACGCCC AGGCCATCAC CGAGGCCAAT CCACTACTAC CCGTGGAGGA GGAGGGCCGG CTTAACGACC CGGTCCTGCG CGAGAACTTC ATCGGCCGGG TCTACGCCTA CCACCGCTGG CAGACCCTCC TGGCCGAGGG GGTGACGGCG GAACGGTTGG TCGACTTTCA CACCGCGCAC AAGCTCATCC TGATGGCACA CGGCCGTCAG GGCCTGCGGG AGATGGGACG ACTGGTGGCC CAGGCCGGCC ACGGAAATCT GGAGCGGATC ACGGAGGCCT ACGGGGCGTC GTTCATGCGC ACGCTGAGGC ACAAGGCCAC CCGGCGGCGG CACACCGATG TGCTCTTCCA CCTGCTGGGC TACCTCAAGC GCACCCTGGA GAGCGACGAG AAGGCCGAGG CGGTGGAGCT GATCCACGAC TACCGGGAGG GGCGGGTGCC GCTGATCGTG CCTGTGACCC TGCTGCGGCA CCACTTTCGC AAGCATCCCG AGCCCTACGT CGAGCGACAG CTCTATCTGC ACTGGCAGCC GGCGGGCCTG GGGCTGTGGA ACGCCATCTA A
|
Protein sequence | MSQASFMPPR DHTKIPVGIS SCLLGEPVRY DRGHKRSAFI TDILAAYFDF RPVCPEVAIG LGVPRPPIRL AGDPERPRAV GVRDPRVDVT APLRAYGERM AGELDYISGY IFKSKSPSCG LFRVKVYGES GRAPSATGRG LYAQAITEAN PLLPVEEEGR LNDPVLRENF IGRVYAYHRW QTLLAEGVTA ERLVDFHTAH KLILMAHGRQ GLREMGRLVA QAGHGNLERI TEAYGASFMR TLRHKATRRR HTDVLFHLLG YLKRTLESDE KAEAVELIHD YREGRVPLIV PVTLLRHHFR KHPEPYVERQ LYLHWQPAGL GLWNAI
|
| |