Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0006 |
Symbol | |
ID | 4269537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 7163 |
End bp | 8950 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638124733 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_740855 |
Protein GI | 114319172 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.780753 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTGATC TCAACCAGGC GCTAATCGGC AATTGCAGCT TCGCTGCCCT GGTCAACCGG GAGGCCGAGG TTACCTGGGC CTGTATGCCC CGGTTCGACG GTGATCCGGT GTTCTGCAGT CTCCTTGGGG ACCCCGGTGT CGGTGACGGC CACGGCCGTT TCGCCATCCT TCTCGAGGGC CTTGCCCGCA GTGAACAGGC CTACGACCGC AACACCGCCA TCCTCCGCAC CCGCCTCTAC GACCACCAGG GCGGGGTTGT GGAGGTCCTC GATTTTGCAC CCCGATTCAA ACAGTTCGGC CGCAGCTTCC ATCCGGTGAC CCTGGTGCGC CAGGTGCGCT GCCTGGGGGG CGCTCCCCGC GTCCGGATGG TGGTCCGGCC GGTGTTCGAC TACGGCACGA CCCGACCGAT CATCACCCAC GGAAGCAATC ACATCCGCTA CGTCTCGGGC GAGCAGGTGC TGCGCCTTAC TACCAACGCC TCACTGACCG CCATTCTTGA CGAGACGCCG GTGGTGGTGG ATCAGGTCTT TACCTTGATC CTGGGACCGG ACGAGACCCT GCCGCTCTCC CCGCACGAGG CCGCCAGCCA GTTCTACGAG CGCACCACTC ACTACTGGCA GGAGTGGGTC CGCTACCTGG CCGTGCCCTT CGAATGGCAG GAGGCGGTAA TCCGCGCCGC CATCACCCTC AAACTCAATA CCTATGAAGA CACTGGGGCG GTCATCGCCG CGGTGACCAC CTCGTTACCG GAGGCGCCGC ATACCACCCG GAACTGGGAC TACCGCTACT GCTGGCTGCG CGACGCCTAT TTCGTCATCA ATGCCCTCAA TCGCCTGGGT GCCACCAAGA CCCTGGAGGA CTACGCCCGC TTCATCATCA ACACCACCGC GTCCAACGGT CAGCGGCCCC TGCGGCCGGT GTATCGGATC AACGGCCGGG ACGATCTGGA TGAGTACGTC GCCGAGGGCC TACCAGGTTA CCGCGGCATG GGCCCGGTCC GGGTGGGGAA CCAGGCCTAT GAGCAGCAGC AGAACGACGT CTACGGCGCG GTGATCCTTG CCACCGCCCA CCTCTTCTTC GACCGCAGGC TCCGCCGCAT GGGCGATGAA CCCCTGTTCC GGCGGCTGGA GGTTCTGGGG GAGCAGGCGG TGGCGGTCCA CGAGCAGCCG GATGCCGGGC TCTGGGAGTT TCGCGGCATG GCCGGGGTGC ATACCTTCTC CGCCACCATG TGCTGGGCCG GGGTCAATGC CCTGGCGGCC ATCGCCCGGG AGCTGAACCT GCCGGACCGG GCCACCTATT GGATCACCCG GGCCCATCGG ATACGCCGGA CCATCCGGGA GCGCGCCTGG AACCCCCGCC GCCAGTCCTA TGTCGCCACC TTCGGAGGCG AGGTCCTGGA CGCCAGCCTG ACGCAGCTCT ATGACCTGGG GTTTCTCAAA CCGGGGGATC CCCGCCTTAC CCAGACAGTG GTGGCCCTTG AAGGCGAACT GCGTCACGGT GATTTTCTTT TCCGCTATAT TCATGAAGAC GACTTCGGAA AACCGACCAC GGCGTTCACC ACCTGCACCT TCTGGTATGT CGACGCCCTG GCCGCCACCG GCCGGCGGGA GGAGGCGAGG GCGCTGTTCG AGAAGCTGCT GGCCTGCCGT AATCACGTGG GCCTGCTTTC CGAGGATATC GACCCCTACA CGGGGGAGCT TTGGGGGAAC TTCCCCCAGA CCTATAGCAT GGTGGGTCTG ATCAATTCGG CGATGCGACT GAGCAAACCG TGGGAGCAGG CCTACTGA
|
Protein sequence | MTDLNQALIG NCSFAALVNR EAEVTWACMP RFDGDPVFCS LLGDPGVGDG HGRFAILLEG LARSEQAYDR NTAILRTRLY DHQGGVVEVL DFAPRFKQFG RSFHPVTLVR QVRCLGGAPR VRMVVRPVFD YGTTRPIITH GSNHIRYVSG EQVLRLTTNA SLTAILDETP VVVDQVFTLI LGPDETLPLS PHEAASQFYE RTTHYWQEWV RYLAVPFEWQ EAVIRAAITL KLNTYEDTGA VIAAVTTSLP EAPHTTRNWD YRYCWLRDAY FVINALNRLG ATKTLEDYAR FIINTTASNG QRPLRPVYRI NGRDDLDEYV AEGLPGYRGM GPVRVGNQAY EQQQNDVYGA VILATAHLFF DRRLRRMGDE PLFRRLEVLG EQAVAVHEQP DAGLWEFRGM AGVHTFSATM CWAGVNALAA IARELNLPDR ATYWITRAHR IRRTIRERAW NPRRQSYVAT FGGEVLDASL TQLYDLGFLK PGDPRLTQTV VALEGELRHG DFLFRYIHED DFGKPTTAFT TCTFWYVDAL AATGRREEAR ALFEKLLACR NHVGLLSEDI DPYTGELWGN FPQTYSMVGL INSAMRLSKP WEQAY
|
| |