Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | LEUM_1066 |
Symbol | |
ID | 4422828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293 |
Kingdom | Bacteria |
Replicon accession | NC_008531 |
Strand | + |
Start bp | 1066085 |
End bp | 1067323 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 639674765 |
Product | arabinogalactan endo-1,4-beta-galactosidase |
Protein accession | YP_818539 |
Protein GI | 116618168 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3867] Arabinogalactan endo-1,4-beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.980933 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTTAA AAAGAGATTG GTGGGTAGTA ATCACTACAA TCGCAATAAC TAGTTTTCTT TCAGTTTCAG TATCGGCTTC GGATGTTACT GTTAAAAAAA TTAATAATTT GAATGGTAAT ACTATAAAAG GTGTCGATAT TTCGTCTGTA ATTAGTGAAG AGAAAAGTGG TGTACGTTAT TTCAATGAAG AAGGAAAACA AGAAAATATT TTTCAAATAT TAAAATCAAA TGGTGTTAAT TATATCAGAG TTAGAGTCTG GAATAACCCA TATACAGAGT CTGGTAATGG GTATGGTGGT GGCAATTCAG ATCTAGCGAA AGCAATTTTA ATTGGCAAAC AGGCAAGTAA ATATGGAATG AAGTTGTTGG TGGACTTTCA TTATTCCGAC TTTTGGGCTG ATCCATCTAA ACAGAAGGCA CCCAAATCGT GGCAGAATTT ATCTTATGAT CAGAAACAAA AGGCAGTATA CGCTTACACT CTGGATAGTT TACAAAAAAT AAAACAGGCT GGAATAGATG TAGGAATGGT GCAAATTGGG AATGAAACTA ATAATGGAAT TGCGGGTACT AGCAAATGGC CAGAAATGGC TGGTATATTT AATTCTGGGT CAGCAGCGGT TCGTAACGTA GATAAAAATA TTTTAGTCGC CGTTCATTTC ACAGATATAC AAAAACAAGG CAATGATAAA TGGATTTCAA AGCAGTTGCA TGATCATAAC GTAGACTATG ATGTTTTCGC TACTTCATAT TATCCTTATT GGCATGGTAG CTTAAGTAAT CTGACACAAT CGCTAAGTGA TGTTTCTCGT ACCTATGACA AAAAAGTTAT GGTCGCAGAA ACTTCGTACC CATACACCTA TCAAGATGGA GACGGGTTCA GGAATACAAT TACTAAAGAT TCTAATATTG TATTCGATTA TCCCGTATCT GTCCAAGGTC AAGCGACGGC TTTGCGTGAT GTTTTTCAGG CTGTCGCAAA CGTCGGAAGC ACAGGCTTAG GTGTTTTCTA TTGGGAGCCC GCATGGGTCC CTGTGGGACC AAAGTCAAAT TTAGAAAGCA ATAGGCTATT ATGGGAGAAA TTTGGGTCCG GATGGGCAAC GAAAAGTGCT AGCGAATTTG ACAAAGATGC TGAGGGAAAT GCAGGGGGAT CATCTTGGGA TAATCAGGCT TTGTTCAATT TTAAAGGTAA AGCCCTACCT TCGCTAAAGA CGTTTAAATA TATAGACACT GGTCATTAA
|
Protein sequence | MNLKRDWWVV ITTIAITSFL SVSVSASDVT VKKINNLNGN TIKGVDISSV ISEEKSGVRY FNEEGKQENI FQILKSNGVN YIRVRVWNNP YTESGNGYGG GNSDLAKAIL IGKQASKYGM KLLVDFHYSD FWADPSKQKA PKSWQNLSYD QKQKAVYAYT LDSLQKIKQA GIDVGMVQIG NETNNGIAGT SKWPEMAGIF NSGSAAVRNV DKNILVAVHF TDIQKQGNDK WISKQLHDHN VDYDVFATSY YPYWHGSLSN LTQSLSDVSR TYDKKVMVAE TSYPYTYQDG DGFRNTITKD SNIVFDYPVS VQGQATALRD VFQAVANVGS TGLGVFYWEP AWVPVGPKSN LESNRLLWEK FGSGWATKSA SEFDKDAEGN AGGSSWDNQA LFNFKGKALP SLKTFKYIDT GH
|
| |