Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1086 |
Symbol | |
ID | 4270031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1264706 |
End bp | 1266277 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638125838 |
Product | extracellular solute-binding protein |
Protein accession | YP_741928 |
Protein GI | 114320245 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACGCA AACCGGCCAC TGCGTTATCC ACCATCGACC TGCGCCGCCG GCGATTGCTC CAGACGCTGG GCATCGGCGG GGCCGCTGCC GGGCTCGGCG GGCTCCCCTG GCTGACCGCA CCCGGCGCCC TGGCCGGCGA ACCCCGCCAC GGGGGGACCC TGCGAGTGGC CCTGCCCCAG GCCGCCACCA TCAATCCGCT GCGCATGAGC GCCGGCGGTG CCATCGCCGT GGTGCAGCAG GTGGCCGAAT ACCTGGTGCG CGTGGACGAG CACCTGAACC TGCAGCCGGC CCTGGCCACG GACTGGGAGA CGCCCGACGA GGGGCGGACC TGGGTGTTCC GCCTGCGCGA GGGGGTGAAG TTCAACGACG GCCGGCCGCT GGAGGCCGAC GACGTGGTGG CCAGCTTCCG GCGCCTGGTG GATCCGGACG TCGCCAGTGT GGCCCGCTCA CAGTTGGACT TTCTGCCGCC CGAGGGCATC AGCGCCGACG ACCGGCACAC CGTCCGCTTC GAGTTATCGC GGCCCATCGG CGCGTTCCCC TACTACACCC AGATCTACAA CGCCGTGATC CTTCCCCGCG ACCACGACGG CGATCTCGCC AGTAACCCCG TCGGCACCGG GCCCTTCCTA CTGACCCACT ACCAGGCCCG CGAAGAGGCC GTGCTGGAGC GCAACCCCGA CTATTGGGAC GCGCCACGGC CCTACCTGGA CCGGGTGCGC ATGGCGATGT ACGACGGTGA TCAGCCCCAG GTGCTGGCCC TGCGCGGCAA CGCCGCCGAC ATGATGCTGT TCGCCAGCTA TATGAACGCC CGCCCGCTGC TGGACAACGA GGAGATCAAC CTGCTCAGCG CCCGCAGCAC CCAGCACCGC CAACTGGCCA TGCGCTGCGA CCAGCCCCCT TTCGAGGACG TCCGCCTCCG CCGCGCCGTG GCCCTGGCCC TGGGCCGGGA GTCCATGGTG AACAACCTCA TTGGCGGCTA CGGCGAACCG GGCAACGACC ACCCGGTGGC GCCCCTCTAC CCGGACGCGG CCGATATCGA CCTGGCCCGG CGCGAGCCCG ACCTGGAACG GGCGCAGGCG CTGCTGGCCG AGGCCGGCGT GCCCAATGGC TTCCAGATCG ACCTCCACAT CGGTCGCCTG GCCGAACTGC CGCAGTACGG CGTCCTGGTC GAACGCATGC TGAACCCGAT CGGGATCCGC GTGAACCTGC GGGTGGAACC ACTGAACACC TACTACGAAC ACTGGACCAA GGTGAACTTC GGCCTCACCG ACTGGGTGGG CCGGGCGGTT CCGGAGCAGA TCCTGGCGGC GGCCTTCCGG GGGGGCGCCG ACTGGAACGC CCCCCATTGG CGCGATGATG AGTTCGACGC CGCCCTTAGC GAGCTGGAGG CGGCCACCGA TCCGGCGCGC CGCCGGCAGC TCACGGCCAC CCTGGCCGGG AAGCTGCACG AGGAGGTGCC GGCGGCCATC ACTTACTTCA CCCAGGCCCT GCGCCCGGTG CGTCGGCGGG TGCAGGGTCT CCGCGCGGAC AATGCCAACT ACCTGGACCT GACCCGGGCC TGGCTGGCCT AG
|
Protein sequence | MRRKPATALS TIDLRRRRLL QTLGIGGAAA GLGGLPWLTA PGALAGEPRH GGTLRVALPQ AATINPLRMS AGGAIAVVQQ VAEYLVRVDE HLNLQPALAT DWETPDEGRT WVFRLREGVK FNDGRPLEAD DVVASFRRLV DPDVASVARS QLDFLPPEGI SADDRHTVRF ELSRPIGAFP YYTQIYNAVI LPRDHDGDLA SNPVGTGPFL LTHYQAREEA VLERNPDYWD APRPYLDRVR MAMYDGDQPQ VLALRGNAAD MMLFASYMNA RPLLDNEEIN LLSARSTQHR QLAMRCDQPP FEDVRLRRAV ALALGRESMV NNLIGGYGEP GNDHPVAPLY PDAADIDLAR REPDLERAQA LLAEAGVPNG FQIDLHIGRL AELPQYGVLV ERMLNPIGIR VNLRVEPLNT YYEHWTKVNF GLTDWVGRAV PEQILAAAFR GGADWNAPHW RDDEFDAALS ELEAATDPAR RRQLTATLAG KLHEEVPAAI TYFTQALRPV RRRVQGLRAD NANYLDLTRA WLA
|
| |