Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1194 |
Symbol | |
ID | 4270329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1394611 |
End bp | 1395600 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638125943 |
Product | periplasmic solute binding protein |
Protein accession | YP_742033 |
Protein GI | 114320350 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4531] ABC-type Zn2+ transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.170646 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGCA AACTGTTCGG CCTGCTTGGC TGTGCCCTTC TGGGGCTCAC GACAAGCATC GGTGCCCTGG CGCAGCCGCC AAAGGTGGTC GCCAGCCTGT TGCCGCTGCA CAGCCTGACG GCGTCGGTCA TGGACGGCGT GGCCGAGCCG CAACTGCTGC TCCCCGGTGG CGCCTCGCCA CATACCTACA GCCTGCGGCC GTCGGAGGCA GAGCATCTGC GTCACGCCGA GCTGGTCATC TGGGTCGGCC CGGAACTGGA GCGGTTCCTG GAGCGGCCGC TGCGCAACCT GGCCGGTGAT GCGGAGAAGA TGACGCTGCT GGCGCTGGAC GGTATCCCCC TGCACCCGAT CCGCGAGGGG GGGCTGTGGG ACCCGCACCA CCACGACGAT CACAGCCACG AACCCCATGG CCACGGCCAT AGTGACCACC ACCATGACCA TGGTCATGGC CAGCCCCACG ACCACCACGG GGACTACGAC ACTCACCTGT GGCTTTCCCC GGCCTTCGCG CGCCAGTTTA TCGACACGCT GGCGGAGAGG CTGGCGGTCC TCGACCCGGA CAACGGCGCG GCCTATCGCG CCAACGCCGC CGCCACCCTG GAGCGGATCG AGCGGTTGGA TGCGTCACTG CATGAACGCC TGGCGTCGCT GGCTGAGCTT CCCTATCTGG TCTTCCACGA CGCCTACCAA TACTTCGAGG CGCATTACGG CCTCAGTCCG GTGGGTTCGG TGACGGTGAG CCCGGAGCGC CAGCCCAGCG CCCGCCGGCT GTCAGAACTC CGCCAGCGCA TCCGTGATAC CGGTGCCCGC TGCGTCTTCG CCGAACCACA GTTCCGCCCG AGCCTGGTGA CCACCCTGGT GGAAGGCACC GGGGCCGAGG CCGGCGTCCT CGACCCGTTG GGCGCAACCC TGGAACCCGG CCCGGACGCC TGGTTCGAGC TGATGGAGCG CCTGGCCGAC GACCTGGCGG CCTGTCTGGC CCGGGCCTGA
|
Protein sequence | MKSKLFGLLG CALLGLTTSI GALAQPPKVV ASLLPLHSLT ASVMDGVAEP QLLLPGGASP HTYSLRPSEA EHLRHAELVI WVGPELERFL ERPLRNLAGD AEKMTLLALD GIPLHPIREG GLWDPHHHDD HSHEPHGHGH SDHHHDHGHG QPHDHHGDYD THLWLSPAFA RQFIDTLAER LAVLDPDNGA AYRANAAATL ERIERLDASL HERLASLAEL PYLVFHDAYQ YFEAHYGLSP VGSVTVSPER QPSARRLSEL RQRIRDTGAR CVFAEPQFRP SLVTTLVEGT GAEAGVLDPL GATLEPGPDA WFELMERLAD DLAACLARA
|
| |