Gene Caul_3122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3122 
Symbol 
ID5900577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3386385 
End bp3387410 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content69% 
IMG OID641563625 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001684747 
Protein GI167647084 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.435185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.194122 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCAGCA TCTCCGCCGC CATCCTCGGC TGCGCCGGGA CCACCCTGAC GGCGGAAGAG 
GCCGCGTTCT TCCGGGACGT GAAGCCGTGG GGCTTTATCC TGTTCAAGCG CAACATCGCC
GATCCCAATC AGGTCCGGGC CCTGACGGCG GCGTTGCGCG AGACAGTGGG GCGCCCCGAC
GCGCCGATCC TGATCGACCA GGAGGGCGGC CGTGTCGCCC GCCTGCAGCC GCCGCACTGG
AAGACCTATC CGCCCGGCCG AGCCTATGGC GAACTGGTGG CCAACGACCC GTTGGCGGCC
CGCGAGATCA CCCGCCTGGG CGCGCGGCTG ATCGCCCACG ACCTGCTGGC GCTGGGGATC
AATGTCGACT GCGTGCCGGT GCTGGACGTG CCCGATCCGC AGGGGCACGA GATCATCGGC
GACCGCGCCT ATGGCGACAC GCCTGAGCAG GTGGCCACCC TGGGCCGCGC GGCGGCCGAG
GGTCTGCTGG CCGGCGGCGT CCTGCCAATC ATCAAGCACA TCCCCGGCCA TGGCCGCGCC
ATGAGCGACA GCCACCTGGA GCTGCCGGTC GTGAAGGCCA AGCTGGCCGA ACTGGACGCC
CGGGACTTCG CGCCGTTCCG CGTGTTGTCC GACATGCCCA TGGCGATGAC CGCCCACGTC
GTTTACACGG CCATCGACCG CCGCAATCCG GCGACGACGT CGCGCAAGGC GATCAAGAAA
ATCATCCGCG AATCCATCGG CTTCGACGGA CTTCTGATGA GCGACGACCT GTCGATGAAG
GCGCTGTCGG GCGACTTCAA GCAGCGCGCC AAGGCCAGTC TGTCGGCCGG CTGCGACGTC
GTTCTGCACT GCAACGGCGA CATGGCCGAG ATGAAGGCGG TGATGTCGGG CGTCGGCAAG
CTGTCGCGCG AGGCCAAGCG CCGGGTGCAG GCGGTCATGG GGCGGCTGGT CAAGGTTCCC
GAGCCGCTGG ACGTGGCCGA GGCTCGCGCC CGCTTCGACG CGGCCTTCAA CGGCGAATTT
GCGTGA
 
Protein sequence
MASISAAILG CAGTTLTAEE AAFFRDVKPW GFILFKRNIA DPNQVRALTA ALRETVGRPD 
APILIDQEGG RVARLQPPHW KTYPPGRAYG ELVANDPLAA REITRLGARL IAHDLLALGI
NVDCVPVLDV PDPQGHEIIG DRAYGDTPEQ VATLGRAAAE GLLAGGVLPI IKHIPGHGRA
MSDSHLELPV VKAKLAELDA RDFAPFRVLS DMPMAMTAHV VYTAIDRRNP ATTSRKAIKK
IIRESIGFDG LLMSDDLSMK ALSGDFKQRA KASLSAGCDV VLHCNGDMAE MKAVMSGVGK
LSREAKRRVQ AVMGRLVKVP EPLDVAEARA RFDAAFNGEF A