Gene Caul_3277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3277 
Symbol 
ID5900732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3539579 
End bp3542503 
Gene Length2925 bp 
Protein Length974 aa 
Translation table11 
GC content66% 
IMG OID641563783 
Productglycoside hydrolase family protein 
Protein accessionYP_001684902 
Protein GI167647239 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGT CTTGGACGAT GACGGCGGCG GCGATCGCCC TGTTCGCGGC CGCCCCCGCG 
TTCGCGGGGG GGCCGTCTCC GCCCACCGGT TTTGACCCTC TGGGCCAACT GCCGGCCCTG
GCGCCCGCGA CTGGCGCGGC GTTCCGCCTC CAGCAGCTGC GCAACGGCGT TCAGTTCCAG
TTGAACGGCG TCACCAAGTC GGTGCTCTTC TATTCGCCGA CGGTCGTGCG GGTGAACGCC
AACCTGGGTC AATCCTACTG GAGCGCGCCA AGCCTCGTGG TGATCCGCCG ACCCGAGAGC
GTTCCTTTCA CCGTGCGCGA GACCCCGGCC GCGGTGGAGC TGGTCGGCCA GAAGCTGCGC
GTCGTCATCG ACAAGCACGC GGGCGCCTTG ACCTATTACG ACGCCGCGGG TCGGCTGCTC
ACCCGCGAGA AGGCCGACGC GCCCCAGTCG ATCGACAAGC GCACCATCGC CGAGGCGCCG
ACCTACGAGG TGGAGAACCG CTTCACGCTC AAGCCGGGAG AAGGCGTTTA CGGCTTTGGC
TTCACCGGCG ACGACGAGGT CAACCGCCGA AACAAGGACT TGCTGCTGGT CCAGACCAAC
GTCGGGATCA TCATTCCGGT GATGGTCTCG ACCGAGCGCT ACGGCGTGCT CTGGGACACC
TATTCGCAGA TGCGCTTCAA AGACGACGAC CAGAGCGCGC GTCTCTGGGC GGAAAGCGCG
CCGGGCGGGG TCGACTACTA CTTCATGGCC GGCGACACGA TGGACGGCGT CGTCGGCGCC
TATCGCTGGC TGACGGGCGA TGCGCCGATG TACCCCAAGC AGGCCTTCGG CTTGTTCATG
AGCAAGGAGC GCTATCCCAC CGAGGAGCGT CTCTTGGAGG TCGCCAGAAC CTTCCGCAAG
GAGAGCTTCC CGCTCGATTA CATCGTCCAG GACTGGCAAT ACTGGGGCGG CGACGAGGAC
GGAACCTGGT CGGGCATGAC CTGGGATGCC AAGCGGTTTC CCGATCCGGT CGGTATGACG
CGGCAGGTCC ACGACCTTCA CATGAAGCTG ATGGTGTCGA TCTGGCCGTC GATTGGCAAC
GACACCGCGC TCGGGCGCGA ACTGGACCAA CACGACCTGC GCTTCAAGCC CCTGCACTGG
ATCTCCAAGC AGGCCCGCGT CTACGACGCC TACAGCCCGC TGGGCCGAGA AATCTATTTC
AAGCACATCA AGTCCGGCCT GCTCGACAAG GGTGTCGACG CGCTATGGAT GGACGGGACC
GAGGTCGAGG TCAGCTCGGC GATGTGGAAT CCGGCCGACA ACATCCGCGA CACCAAGGCG
CTGGGCGCCA ACGCGATGGG TGACTTCACC CGGTACCTGA ACCCGTATTC GCTGCTGACC
ACCGAAGGGA CCTACAAGGG CCAGCGCGCC ACCAGCGACA AGCGGGTCTT CACCCTGACG
CGATCGGCCT GGGCGGGCGC CCAACGCACG GCCGCCGCGT CTTGGTCAGG CGACATCTAC
GCCAGTTGGA AGACCTTCGC CCAGCAGGTG GCGGGAGGCG TCAACGTCAC GATCACCGGC
AACCCCTACT GGACCCAGGA CACCGGCGGC TTCTTCGTCT CCGACTTTCC AGGCGGCGAG
AAGAACCCCG CTTGGCGCGA GCTCTATGCC CGCTGGCTCC AGTACGCGGC CTTCAACCCG
ATCATGCGCA TCCACGGCAC GAGCGTCGAG CGCGAGCCCT ATCTGTTCAA GACCCTGGAT
CCGCCGGTCT ACAAGGCGCT GCTGGACGCC ACCCGGCTTC GGTACCGGCT ACTGCCCTAT
ATCTACAGTC TCAGCGCCAA GGTCACGGCC GATCGCTATA CCCTGATGCG CCCGCTGCCG
ATGGACTTCC CCAAGGATCC GGCGACCTAC AACATCAATG ACAGCTTCAT GTTCGGCTCG
TCACTGCTCG TCCATCCCGT GACGCGCGCG ATGTACAATA TCCTTCCGTC CTCGGCGACG
ACGGTTCCCG GGCGATATCT GCGGACCTCG GACGGCAAGC CCGGACTGAC CGTCCAATAT
TTCGCCGGCG AGAACTTCGA GACGCCGCGC GGTCAGGCGG TCGACACGAA GATCGACCAC
ACCTGGCCCG ATCCTCCGCT GGCGGAGCTT CCGCCCGGCC TGACCTCGCT GAGCCACTTC
TCCGTCCAGT GGGACGGCGA TCTGATCGCG CCGGAGGATG GCGAATACGA AATCGGGCTG
ACAGGCAACG ACGGCATGCG CATGGCGCTG GCCGGCGACA CGGCGATCGA CGAGTGGCGG
CGCGCGGCGA CCCGCACGCG CGTGGTGCGG CGAACGCTCA AGGCTGGCGA AAAGCTGCCA
GTCCGGCTGG AATTCTCCCA TCCCGAGGGC GGCCGCGTGT TCCGTTTCGT CTGGCGCACG
CCCAGCGAGC TGAAGCGGGA CGCCGCCGCG ATGAACGCGC CGCGCGACCT CACGATGCGC
ACCTACCTGC CCAAGGGCGC GGATTGGTAC GATTTCTGGA GTGGCCAGCG CCATCCCGGC
GGTCAGATCG TCGCGCGCCA AGCTCCGCTC GACGTCATGC CGCTCTATGT CCGGGCCGGC
GCGATCATCC CGATGGGGCC GGTCCTGCAG TACGCGACCC AGCACCCCGA TGCGCCCTAC
GAGATCCGCG TCTATCCGGG CGCGGACGGC AAGTTCACGA TCTATGAGGA CGACAACGAA
ACCTACGCCT ACGAGAAAGG TCAATCGGCG CGTTACGACC TCAGCTGGAA CGATGCGACG
CGGACCTTGA CGGTGGGACC TCGGCGCGGC GCGTTCGCCG GCATGGTCAA GCAGCGAACG
CTCAAGATCG TGGTGATCGG CGCCGCCGGG GCAAGCACGC TCACGCCGCC ATCGACCGAT
CGTACAGTGT TGTATGGTGG ACGAGCCTTG ACCGTACGGT ACTGA
 
Protein sequence
MKASWTMTAA AIALFAAAPA FAGGPSPPTG FDPLGQLPAL APATGAAFRL QQLRNGVQFQ 
LNGVTKSVLF YSPTVVRVNA NLGQSYWSAP SLVVIRRPES VPFTVRETPA AVELVGQKLR
VVIDKHAGAL TYYDAAGRLL TREKADAPQS IDKRTIAEAP TYEVENRFTL KPGEGVYGFG
FTGDDEVNRR NKDLLLVQTN VGIIIPVMVS TERYGVLWDT YSQMRFKDDD QSARLWAESA
PGGVDYYFMA GDTMDGVVGA YRWLTGDAPM YPKQAFGLFM SKERYPTEER LLEVARTFRK
ESFPLDYIVQ DWQYWGGDED GTWSGMTWDA KRFPDPVGMT RQVHDLHMKL MVSIWPSIGN
DTALGRELDQ HDLRFKPLHW ISKQARVYDA YSPLGREIYF KHIKSGLLDK GVDALWMDGT
EVEVSSAMWN PADNIRDTKA LGANAMGDFT RYLNPYSLLT TEGTYKGQRA TSDKRVFTLT
RSAWAGAQRT AAASWSGDIY ASWKTFAQQV AGGVNVTITG NPYWTQDTGG FFVSDFPGGE
KNPAWRELYA RWLQYAAFNP IMRIHGTSVE REPYLFKTLD PPVYKALLDA TRLRYRLLPY
IYSLSAKVTA DRYTLMRPLP MDFPKDPATY NINDSFMFGS SLLVHPVTRA MYNILPSSAT
TVPGRYLRTS DGKPGLTVQY FAGENFETPR GQAVDTKIDH TWPDPPLAEL PPGLTSLSHF
SVQWDGDLIA PEDGEYEIGL TGNDGMRMAL AGDTAIDEWR RAATRTRVVR RTLKAGEKLP
VRLEFSHPEG GRVFRFVWRT PSELKRDAAA MNAPRDLTMR TYLPKGADWY DFWSGQRHPG
GQIVARQAPL DVMPLYVRAG AIIPMGPVLQ YATQHPDAPY EIRVYPGADG KFTIYEDDNE
TYAYEKGQSA RYDLSWNDAT RTLTVGPRRG AFAGMVKQRT LKIVVIGAAG ASTLTPPSTD
RTVLYGGRAL TVRY