Gene Acid345_1990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1990 
Symbol 
ID4070896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2384116 
End bp2385234 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content58% 
IMG OID637984004 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_591065 
Protein GI94969017 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.639682 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCTA CTCGGCTCCA GGAACTCCAG CAGCAAGTCG GCCAGCTAAT GATCGTCGGC 
TTCGACGGCA CCGAGATGTC TGCGCGCGTA CGCACGCTTC TCGCTACGAT CCAGCCGGCG
GGTACCATCT TCTTCAAGCG CAACGTAGCG ACTGCCGAGC AGACATGGAA GCTCAACTAC
GAGGCGCAGG CGGCCGTTTC CACACCGCTC TTCCGTTGCG TTGACCTCGA AGGCGGCACC
GTCGATCGCC TTCGAGACGC AGTCGCTCCT GCGCCGTCGC TCTCTAATGT GGCGGCGACC
GGATCAAAAA AAGTCATGCG TCGCTTTGCT CGGACGCTCG CAGCAGAAGC TCGCGCTCTC
GGATTCAACA CTGACTTCGC TCCGGTCTTC GACCTGCGCA CAGTCGAATC AGTCAAGGTT
CTCGCCGGCC GAACGATCGC AGCCGATCCC AAGCACATCA TCGAACTCGC CAGGGAGTTC
TTGAAGGGCT TCAAAGACGA AAACGTTCTC GGTTGCGGCA AGCATTTTCC CGGCCTCGGC
GCGGGTGCTG TCGATTCCCA CTACGAGCTG CCAACCATTA GCAAGCCCTG GAAGGCGTTA
TGGGAAGAGG ACCTGCTTCC CTATCGCAAG CTTAAAGACG AGATCGCCTT TGCGATGGTC
GCGCACTGCG TTTACCCGAA CGCTACGAAA GAAAAGGCCC CCGCTTCCAT CTCCCGTTTC
TGGATGACAG ACATCCTGCG CAAGAAGATC GGATTTAAGC ACCTCATCTG TTCCGACGAC
ATGGAGATGA AAGGTGTTCA AAAAGCGGTT TCGATCGAAG AAGCCTGCAT CCAGGCAGTC
CGCGGCGGCG CCGATCTTTT TCTCGTCTGC AACAACGAAT CGCTTGTGTG GCGTTGCTTT
CACGCTGTGC TGCGTGAAGC CGAACGCGAC AAATCCTTTG CGAAACAAAT CGCAGCCGCG
TCTCGCCGCG TGTTCGAGTT CAAGAAACGT TCGCGGGCCG TGCGAGCCAA GTTCAACCCT
GCGCCGACTC TCCGCACCGT AGACAAGCTT CGTCGCACGA TCTGGGAACT CACCGAAGAA
GTTCGCTACA GCAGCCCCAA TCCGGAGCGG GCCCTTTGA
 
Protein sequence
MASTRLQELQ QQVGQLMIVG FDGTEMSARV RTLLATIQPA GTIFFKRNVA TAEQTWKLNY 
EAQAAVSTPL FRCVDLEGGT VDRLRDAVAP APSLSNVAAT GSKKVMRRFA RTLAAEARAL
GFNTDFAPVF DLRTVESVKV LAGRTIAADP KHIIELAREF LKGFKDENVL GCGKHFPGLG
AGAVDSHYEL PTISKPWKAL WEEDLLPYRK LKDEIAFAMV AHCVYPNATK EKAPASISRF
WMTDILRKKI GFKHLICSDD MEMKGVQKAV SIEEACIQAV RGGADLFLVC NNESLVWRCF
HAVLREAERD KSFAKQIAAA SRRVFEFKKR SRAVRAKFNP APTLRTVDKL RRTIWELTEE
VRYSSPNPER AL