Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2731 |
Symbol | |
ID | 4270985 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 3099903 |
End bp | 3100919 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638127493 |
Product | extracellular solute-binding protein |
Protein accession | YP_743561 |
Protein GI | 114321878 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.476445 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.605201 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATACTA CTGATCTGAT CAAGGCCGGC CGCCTGGCCG CCACCGCCGC CCTCTGCGGT GCCACGCTGT TCGCCGGTAG CGCGAACGCC GAGCGGCTGC ACGAGATCCG CGACCGCGGC ACACTGACCG TTGCCCTGTA CAACGACTTC GGGCCCTACT CCTGCGTCGG CACCGGCGGC GAGCTGATCG GGGTTGACGT GGCCCTGGCC CGCGCCCTGG GCGAGAAGCT GGACCTGAAG GTGGAGCTGG CCGGCTTCGG CGCCCAGGAC AGCATGGACC AGGACCTGGC CCTGCTGCAG GACGAACAGG CCGAGGACGA GTGGGACGAG GCCCTGCTCG AGCGCGCCCC CGACCTGATG ATGCACGTAC CGGTGGACCC CGTCTTCCAG GAGCGCAACG CCGATTACGA CATCATGGGC GCCTACTTCC ACGAGGCCAT GGCGGTGCTC TACGACCGCG AGGAAATCGG CGATCTGGGG TACTCGGTGA ACACCCCGGA CCCGTTCGAT GGCCTGCGCG TGGGTGTGGA GATGTACACC TACTCCTACA CCATGCTCAC CAACGGCTTT GACGGCCGCC TGCGCTCCGG CGTGGTCAAC CACAAGAACG TGCCCGAAGC AGTGGAGGCC CTGCTTGCCG GTGAGACCTC CGCGGTTTTC GCACCGCGCG GCGAGCTTCA GAGCGCCCTG GCCGCCTTCC CCGAACCGCG CACCAGCCTC GCCCTGAGCG AGCTGCGCGA CCTGTTCCGC ACGGACCGGG TGCGTAGCGA CTGGGACGTG GGCATGGCCG TCAAGGCCGG CAACCCGGAG CTCTCCCGGG CAGTCGAGGA GGCCATGGCA CAACTGGTGG CCGACGGGAC CGTGGAGCGT ATCTTTAACG AATACGGCAT CGCCTGGGTG GGCCCGGGGG AGGAGTACCG CCTAGCCCGC AACGGCAATG GCCCCGCGGG CGTGACCCGC ACCGGCCTGG AACGCGCGCA GCTCTGCCGG GCAACAATGC CGGCCGGACT GTACTGA
|
Protein sequence | MNTTDLIKAG RLAATAALCG ATLFAGSANA ERLHEIRDRG TLTVALYNDF GPYSCVGTGG ELIGVDVALA RALGEKLDLK VELAGFGAQD SMDQDLALLQ DEQAEDEWDE ALLERAPDLM MHVPVDPVFQ ERNADYDIMG AYFHEAMAVL YDREEIGDLG YSVNTPDPFD GLRVGVEMYT YSYTMLTNGF DGRLRSGVVN HKNVPEAVEA LLAGETSAVF APRGELQSAL AAFPEPRTSL ALSELRDLFR TDRVRSDWDV GMAVKAGNPE LSRAVEEAMA QLVADGTVER IFNEYGIAWV GPGEEYRLAR NGNGPAGVTR TGLERAQLCR ATMPAGLY
|
| |