Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0101 |
Symbol | |
ID | 4268839 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 110648 |
End bp | 111748 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638124827 |
Product | hypothetical protein |
Protein accession | YP_740948 |
Protein GI | 114319265 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.867237 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCCGC GCCCGCTGGA GCACCTGCTG ACCGCACTGA CCGACCCCGC CTCCGTCGGC GAACTGCGCC TGGGCGATTG GGACGCCCTG CTGCGGGTGG CCCGGGTCGC CTCACTGGAG GCACGCCTGC ACGCGCTGTT ACAGGAGCGG GATCTGTTCG ACCGGGTGCC CGCCCGACCG CGGCGCCACC TGGAGGCGGC GGGCCGGGTA GCCGCCGAAC AACACCACCG GATGCGCTGG GAGGTGGAAC AGGTCCATGA GGCGCTGGCC GCGCGCAAGG CGCCGGTGGT GATTCTCAAG GGGGCGGCCT ACCTGATGGC CGGCCTGCCC AGCGCCCGCG GCCGGCTGTT CGCGGACCTG GACATCATGG TCCCACGCCC GGCACTGGCG ACCACCGAAC ACGTCCTGTT CACCCGCGGC TGGCTGGCGC AGGGCCACGA CGAATACGAC CAGACCTACT ACCGGCGCTG GATGCATGAA CTGCCCCCGC TGACCCACAT CCGGCGAAAG AGCGTGCTGG ACGTCCACCA CACCGTCCTG CCGCCGACGG CGAGATTGCA CCCGGACCCG GACAAGCTAT TCGCCGCCGC CACGCCGCTC CCGGGCTGGC AAAACCTCTA TGTGCTGGCC CCCACCGACA TGGTGCTGCA CAGCGCGACC CACCTCTTCC ACGACGGCGA GTTGGAGAAC GGCCTGCGGG ATCTGGTCGA TCTGGATGAC CTGGTCCGCC ATTTCCACCG CCACGTCGAC GGCTTCTGGC CCGCGCTGGT GGACCGGGCC CATGAGATGG ACCTGGCGCG CCCGCTGTTC TACGGCCTGC GGTATGCGGC CCACTTTCTG AACACCCCGG TGCCGGCGAC CGTCAACGAG GGGCTCGCCG CAGCCGGCCC GGGCGTGCCA CTGCGCCCGC TCATGGACGG GCTGTTCCGG CGCGGCCTCG CGCCCCACCA TTGGCAATGC GACGATTGGC TTTCCCCGAC CTGCCGATGG ATACTTTACG TAAGGTCGCA CTACCTGCGC ATGCCCTTGC GCCTGCTGGT ACCCCACCTC ACCCGCAAGG CCATCAAGAG ACGAATGGCG GCCCCCGAGG CCCACGCCTA G
|
Protein sequence | MMPRPLEHLL TALTDPASVG ELRLGDWDAL LRVARVASLE ARLHALLQER DLFDRVPARP RRHLEAAGRV AAEQHHRMRW EVEQVHEALA ARKAPVVILK GAAYLMAGLP SARGRLFADL DIMVPRPALA TTEHVLFTRG WLAQGHDEYD QTYYRRWMHE LPPLTHIRRK SVLDVHHTVL PPTARLHPDP DKLFAAATPL PGWQNLYVLA PTDMVLHSAT HLFHDGELEN GLRDLVDLDD LVRHFHRHVD GFWPALVDRA HEMDLARPLF YGLRYAAHFL NTPVPATVNE GLAAAGPGVP LRPLMDGLFR RGLAPHHWQC DDWLSPTCRW ILYVRSHYLR MPLRLLVPHL TRKAIKRRMA APEAHA
|
| |