Gene Lcho_0593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_0593 
Symbol 
ID6159702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp643161 
End bp644264 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content72% 
IMG OID641663343 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001789633 
Protein GI171057284 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0144455 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCACT CCCCCGTCGT TCTGGACATC GCCGGCACCA CGCTCGACGC CGACGACCGC 
CGTCGCCTGC AGCACCCGCT CGCCGGCGGG CTGATCCTGT TTGCGCGCAA CTGGGTCGAC
CGCCGCCAGC TGGTCACGCT GATCGCCGAG ATCAAGGACC TGCGGCCCGA CCTGCTGATC
TGCGTCGACC ACGAAGGCGG GCGGGTGCAA CGCTTCAAGA CCGACGGGTT CACCCACCTG
CCGCCGATGC GCGCGCTCGG CGAACGCTGG ATGCGCGACG AGCGCGGCCA GCCCGGCAGC
GGCGCGATGC GTGCCTGCGA GGCCGCCACC GCCACCGGCT ACGTGCTGGC GGCGGAGCTG
CGCGCCTGTG GGGTCGACCT CAGCTTCACG CCGGTGCTCG ACCTCGAACA CGCGCACAGC
AACGTGATCG GCGACCGCGC GCTGCACCGC GACGCCCGCG TCGCCACGCT GCTGGCCAAG
AGCCTGATGC ACGGCCTGCT GCAGGCCGGC ATGGGCAACT GCGGCAAACA TTTCCCCGGC
CACGGCTGGG CCCGGGCCGA CAGCCACGTC GCCATCCCGC GCGACACCCG CTCGCTCAAG
GCCATCCTGG CCGACGACGC CCTGCCCTAC GCCTGGCTGT CGAGCAGCCT GACGGCGGTG
ATGCCGGCGC ACGTGATCTA CCCGAAGGTC GATGCGCGCC CGGCCGGCTT CTCGGCGCGC
TGGCTGCAGG AGATCCTGCG TGACCAGTTC GGCTTCACCG GCGCCGTCTT CAGCGACGAC
CTCAGCATGG CGGCGGCGCG CTCGGTGCCC GACGTGCAGG GCGGCGCCGA GCTGAGCTAC
AGCCAGGCCG CGCTGCTCGC GCTGGAGGCG GGCTGCGACA TGGTGCTGCT GTGCAACCAG
TCGCTCGGCG ACGGCGGCCG GGCCGTCGAT GAACTGCTCG ACGGCCTGGG CGACGCCATC
GAACAAGGCC GATGGAGACC CGACCCGGAC AGCGAAACCC GCCGCATCGC CCTGCTGCCG
CAGACCCCGC CGCTGCCGTG GGACGAGCTG ATGCACCACG CGCCCTACCA GCGCGCGCTG
GAACTGATCG GCGAGCCGGG CTGA
 
Protein sequence
MNHSPVVLDI AGTTLDADDR RRLQHPLAGG LILFARNWVD RRQLVTLIAE IKDLRPDLLI 
CVDHEGGRVQ RFKTDGFTHL PPMRALGERW MRDERGQPGS GAMRACEAAT ATGYVLAAEL
RACGVDLSFT PVLDLEHAHS NVIGDRALHR DARVATLLAK SLMHGLLQAG MGNCGKHFPG
HGWARADSHV AIPRDTRSLK AILADDALPY AWLSSSLTAV MPAHVIYPKV DARPAGFSAR
WLQEILRDQF GFTGAVFSDD LSMAAARSVP DVQGGAELSY SQAALLALEA GCDMVLLCNQ
SLGDGGRAVD ELLDGLGDAI EQGRWRPDPD SETRRIALLP QTPPLPWDEL MHHAPYQRAL
ELIGEPG