Gene LEUM_1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLEUM_1066 
Symbol 
ID4422828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeuconostoc mesenteroides subsp. mesenteroides ATCC 8293 
KingdomBacteria 
Replicon accessionNC_008531 
Strand
Start bp1066085 
End bp1067323 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content38% 
IMG OID639674765 
Productarabinogalactan endo-1,4-beta-galactosidase 
Protein accessionYP_818539 
Protein GI116618168 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3867] Arabinogalactan endo-1,4-beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.980933 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTTAA AAAGAGATTG GTGGGTAGTA ATCACTACAA TCGCAATAAC TAGTTTTCTT 
TCAGTTTCAG TATCGGCTTC GGATGTTACT GTTAAAAAAA TTAATAATTT GAATGGTAAT
ACTATAAAAG GTGTCGATAT TTCGTCTGTA ATTAGTGAAG AGAAAAGTGG TGTACGTTAT
TTCAATGAAG AAGGAAAACA AGAAAATATT TTTCAAATAT TAAAATCAAA TGGTGTTAAT
TATATCAGAG TTAGAGTCTG GAATAACCCA TATACAGAGT CTGGTAATGG GTATGGTGGT
GGCAATTCAG ATCTAGCGAA AGCAATTTTA ATTGGCAAAC AGGCAAGTAA ATATGGAATG
AAGTTGTTGG TGGACTTTCA TTATTCCGAC TTTTGGGCTG ATCCATCTAA ACAGAAGGCA
CCCAAATCGT GGCAGAATTT ATCTTATGAT CAGAAACAAA AGGCAGTATA CGCTTACACT
CTGGATAGTT TACAAAAAAT AAAACAGGCT GGAATAGATG TAGGAATGGT GCAAATTGGG
AATGAAACTA ATAATGGAAT TGCGGGTACT AGCAAATGGC CAGAAATGGC TGGTATATTT
AATTCTGGGT CAGCAGCGGT TCGTAACGTA GATAAAAATA TTTTAGTCGC CGTTCATTTC
ACAGATATAC AAAAACAAGG CAATGATAAA TGGATTTCAA AGCAGTTGCA TGATCATAAC
GTAGACTATG ATGTTTTCGC TACTTCATAT TATCCTTATT GGCATGGTAG CTTAAGTAAT
CTGACACAAT CGCTAAGTGA TGTTTCTCGT ACCTATGACA AAAAAGTTAT GGTCGCAGAA
ACTTCGTACC CATACACCTA TCAAGATGGA GACGGGTTCA GGAATACAAT TACTAAAGAT
TCTAATATTG TATTCGATTA TCCCGTATCT GTCCAAGGTC AAGCGACGGC TTTGCGTGAT
GTTTTTCAGG CTGTCGCAAA CGTCGGAAGC ACAGGCTTAG GTGTTTTCTA TTGGGAGCCC
GCATGGGTCC CTGTGGGACC AAAGTCAAAT TTAGAAAGCA ATAGGCTATT ATGGGAGAAA
TTTGGGTCCG GATGGGCAAC GAAAAGTGCT AGCGAATTTG ACAAAGATGC TGAGGGAAAT
GCAGGGGGAT CATCTTGGGA TAATCAGGCT TTGTTCAATT TTAAAGGTAA AGCCCTACCT
TCGCTAAAGA CGTTTAAATA TATAGACACT GGTCATTAA
 
Protein sequence
MNLKRDWWVV ITTIAITSFL SVSVSASDVT VKKINNLNGN TIKGVDISSV ISEEKSGVRY 
FNEEGKQENI FQILKSNGVN YIRVRVWNNP YTESGNGYGG GNSDLAKAIL IGKQASKYGM
KLLVDFHYSD FWADPSKQKA PKSWQNLSYD QKQKAVYAYT LDSLQKIKQA GIDVGMVQIG
NETNNGIAGT SKWPEMAGIF NSGSAAVRNV DKNILVAVHF TDIQKQGNDK WISKQLHDHN
VDYDVFATSY YPYWHGSLSN LTQSLSDVSR TYDKKVMVAE TSYPYTYQDG DGFRNTITKD
SNIVFDYPVS VQGQATALRD VFQAVANVGS TGLGVFYWEP AWVPVGPKSN LESNRLLWEK
FGSGWATKSA SEFDKDAEGN AGGSSWDNQA LFNFKGKALP SLKTFKYIDT GH