Gene Caul_0293 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0293 
Symbol 
ID5897567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp324940 
End bp327285 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content70% 
IMG OID641560777 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001681928 
Protein GI167644265 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.218419 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTCG TAGCCAAGCC CTTCGCCAAA CTCGCCGCCC TGGGCGCATC GGTCGCCGTC 
CTGGCCTTCG CCTCGGCCAG CGTCGCCGCG ACGCCGACCC TGACCTTGAC GCCGGCGCCC
GCCCAGGCCG AGATGGGGCA AGGGGTCTTC GCCCTGACCG CCCGGACCCG GATCTTCGTC
GCCAAGGGCG ATGTCGAGGC CAGGGTCGTG GCCAGCCAGC TTTCCGACAT GCTGTTCAAG
GCCCGAGGCC TGAAGCCCGC CGTCGTCGAG GGCGCGCCGC CCGCAGGCGA GGCCGCGATC
GTCCTGGTCC GGACCCAGGC CGCCCCTGAA GCAGGAATTG GCGACACGGC CGAGGCCTAC
CTCCTCGACG TCGCCCCGAC CGGCGTCACC ATCACCGCTC CCAAGCGCGC CGGTCTGTTC
TACGGCGCGG TCAGCGTCTG GCAACTGGCC GTGCAGGACG CCGCCAAGGG TCCCGCGGAC
CTGCCGGCGG TCAGCATCGT CGACGCCCCG CGTTTCGCCT GGCGCGGCTT CATGCTCGAC
AGCGCCCGCC ACGTCCAGAG CATCGACACC ATCAAGGCGA TCCTCGACGC CATGGCCGCC
CACAAGCTCA ATGTCCTGCA TTGGCATCTG GTCGATGACC AAGGGTGGCG GCTGGAGATC
AGGAAATATC CGAGGCTGAC GTCCGAAGGG GCCTGGCGCG CGCCGGCAGG GGCGGCGGGC
AAGGACCCCA AGACCGGCAA GCCGATCCGC TACGGCGGCT TCTACACCCA GGACCAGGTG
CGCGACCTAG TCGCCTACGC CGCCGCGCGC GGCGTCACCA TCGTGCCCGA GATCGAGATG
CCGGGCCACG CTCTGGCGCC GCTGGTGGCC TATCCCCAGT TCGGCATGAC GAAGACTCCG
CCGCGCGCCA GCATGGGCGA CTGGGGCGTG TTCCCCTATC TCTACAGGCC CAGCGAAGAG
ACGTTCACGT TCCTCAACGA CGTGCTCGAC GAGGTGATGG ACTTGTTCCC CTCGCCCTAT
ATCCATGTGG GCGGCGACGA GGCGGTCAAG GATCAGTGGA AGGCCAGCCC CGAGGTCCAG
GCCCAGATCC AGGCCTTGGG CGTCAAGGAC GAGCACGGCC TGCAGAGCTG GTTCATCCAG
CGGGCCGAGA AGCACATCAA CGCCCGCGGC CGGCGGATGA TCGGCTGGGA CGAGATCCTT
GAAGGCGGCC TGGCCCCCAA CGCCACGGTG ATGTCCTGGC GCGGCGTCGA CGGCGCGGTC
GCCGCCGCCT CGCAGGGTCA CGACGCCGTC CTGGCCCCAG ACAGCACGCT CTACATGGAC
CGCCGGCAGA GCGCCTCGGC CGACGAGCCG CCTGGCCGGA TCAAGATCAC CAGCCTCAAG
GACGTCTACG CGTTCGACGC CGCCCCGGCC GCGCTCACAC AGGCCCAGCG CGCCCACATC
CTGGGCCTGC AGGCCACGTC GTTCACCGAG CACATGCGCA CCGACGAGCG ACTGGAAAGG
ATGACCTTCC CCCGGCTGGT CGCGGTGGCC GAGAATGGCT GGACGCCTGA GGCCCAGCGC
GACTGGACCC GCTTCGCCGC CCGCCTGCCG GCCGAGACCG CGCGGCTGGA CGCGCTGGGT
GTCGCCCACG ACACCGTGCC CTACGAGCCC CAGGCGACCC TGACCCCGGC GGCGGACGGC
GAGATCTCGG TCGCCCTGGC CTCGGGTCTG GGCCTGGGTG AGATCCGCTA CACCACCGAC
GGCCAGGCGC CGACCAAGAC CTCCGCGCTG TATGATGCGC CTCTCGCGGT CGCGCCCGGC
AAGACGCTCC GTGTCCGGAC GTTCCTCGAA GATGACGCCC TGGGCCGGAT CCGCGACTAC
CCCATCAGCC TGGCGGCGGC CCGGACGCGC AACAGCCATC AGCTCGAGAC CTGCGGCAAC
GGCATCAATC TCTCATTGGA GGACGATGCG CCGGTCACGG GTCCGCGCGC CGTGTTCGCG
GTCGACCTGA TGAACCCCTG CTGGGTGTGG AAGGGCGCGG ACCTGTCCTC GGTCCTGAAG
CTGACGGCTC GCATTGGCCA GGTCCCGTTC AACTTCCAGA TCGGCGCCGA CAAGGCCAGG
ATCCCGTTGC GGGCGCCAGC GACGCCGGAC GGCGAGCTCG AAGTGCGCCT CGACGGGTGC
GCGGGCGAGC GGATCGCCGT CCTCCCGCTA GGCGCCGCGG CGCGCGGACC GGCCATGGGA
ACCGTTTCCG GCGTGGTCCC CGCCAAGGGT GGCGTCCACG ACCTCTGCCT CAGCTTTACA
GCGCGCGGCG TCGAGCCGAC GCTGGTTCTC GACCAGGTCA CGCTGACCCC AACCCACAAG
AACTAG
 
Protein sequence
MTFVAKPFAK LAALGASVAV LAFASASVAA TPTLTLTPAP AQAEMGQGVF ALTARTRIFV 
AKGDVEARVV ASQLSDMLFK ARGLKPAVVE GAPPAGEAAI VLVRTQAAPE AGIGDTAEAY
LLDVAPTGVT ITAPKRAGLF YGAVSVWQLA VQDAAKGPAD LPAVSIVDAP RFAWRGFMLD
SARHVQSIDT IKAILDAMAA HKLNVLHWHL VDDQGWRLEI RKYPRLTSEG AWRAPAGAAG
KDPKTGKPIR YGGFYTQDQV RDLVAYAAAR GVTIVPEIEM PGHALAPLVA YPQFGMTKTP
PRASMGDWGV FPYLYRPSEE TFTFLNDVLD EVMDLFPSPY IHVGGDEAVK DQWKASPEVQ
AQIQALGVKD EHGLQSWFIQ RAEKHINARG RRMIGWDEIL EGGLAPNATV MSWRGVDGAV
AAASQGHDAV LAPDSTLYMD RRQSASADEP PGRIKITSLK DVYAFDAAPA ALTQAQRAHI
LGLQATSFTE HMRTDERLER MTFPRLVAVA ENGWTPEAQR DWTRFAARLP AETARLDALG
VAHDTVPYEP QATLTPAADG EISVALASGL GLGEIRYTTD GQAPTKTSAL YDAPLAVAPG
KTLRVRTFLE DDALGRIRDY PISLAAARTR NSHQLETCGN GINLSLEDDA PVTGPRAVFA
VDLMNPCWVW KGADLSSVLK LTARIGQVPF NFQIGADKAR IPLRAPATPD GELEVRLDGC
AGERIAVLPL GAAARGPAMG TVSGVVPAKG GVHDLCLSFT ARGVEPTLVL DQVTLTPTHK
N