Gene Caul_1291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1291 
Symbol 
ID5898746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1360330 
End bp1361994 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content66% 
IMG OID641561776 
Productglycoside hydrolase family protein 
Protein accessionYP_001682919 
Protein GI167645256 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.186796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0579695 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGGAACC AAGATAGAGT TCGACCGGGA CGGATCCGGT CCTTGGCGAT GATCGCGGCG 
CTGCTCGCCG CCACCGCCTC GGCCGGCATG GGGGCCTCCA CTCCAGCCGC CGCCGCCGAC
CTGCCCAAGT TCGTCGCCAA GGACGGTCGC CACGCCCTGA TGGTCGATGG GGCGCCGTTC
CTGATGCTGG GCGTGCAGGT CAACAATTCC AGCAACTACC CTTCGCAACT GCCCAAGGTC
TGGCCGGCGG TGAAGGCGCT GCAGGCCAAC ACCGTCGAGG TCCCGATCGC CTGGGAGCAG
ATCGAGCCGG TCGAGGGCAG GTTCGACTTC TCGTTCCTCG ACGTGCTGCT CAAGCAGGCC
CGCGAGAACG ACGTCAAGCT GGTGTTGCTG TGGTTCGGGA CGTGGAAGAA CAACGCCCCC
AACTACGCGC CCGAGTGGGT CAAGCTGGAC AATACGCGCT TTCCGCGGGT GGTCACCGCC
AAGGGCGAGA CCCGCAACTC GCTGTCGCCG CACTTCCCCG CCACCCTGGA GGCCGACAAG
AAGGCCTTCG TGCAGTTGAT GCGCCACCTG AAGGCCGCCG ATCCGGACCA CACGGTGATC
CTGGTCCAGC CCGAGAACGA GACGGGCGTC TACAGCGCAG TTCGCGACTA TTCCCCCGCC
GCCCAGAAAC TGTTCGAGGG TCCTGTTCCG GCCGAACTGG TCAAGGCGAT GGGCAAGACG
CCAGGAACCT GGAGCCAGGT GTTCGGCAAG GACGCCGACG AATATTTCCA CGCCTGGTCG
ATCGGCCGCT ACGTCGATCA GATCGCCGCC GCCGGCAAGC GCGAGCTGGC GCTGCCGATG
TATGTCAACG CCGCCCTGCG CGATCCCTTC AAGGACCAGG ACCCCTACAC CTACTCGTCG
GGCGGACCGA CCTGGAACGT GCTCGACGTC TGGAAGGCGG CGGCGCCGTC GATCGACGCC
ATCGCGCCGG ACATCTACAT GCGCGAGAGC AGCAATGTCC GAAAGACGCT GGCGCAGTAC
GGCCGGCCCG ACAATCCGCT GTTCGTGCCG GAGATCGGCG ACGACAAGGG GTTCGCGCGC
TACTTCTACG ATGTGCTGGG CGCCCACGGC CTGGGCTTCT CACCGTTCGG CCTGGATCAG
ACCGGCTATT CCAACTACCC GCTCGGCGCC AAGACGGTCG ACGCCCAGGC GCTGGAGACC
TTCGCGGTTC ACTACCGCCT GCTGGCCCCG ATGGCTCGCC AATGGGCCAG GCTTTCCTAC
GAGGGCAAGG TCTGGGGCGC CGGCGAGGCG GATGACCGCA AGGCCGAGAC CCTGAAGCTG
GGCGATCGCT GGACCGCCAC CCTGTCCTAC GGCGAATGGC AGTTCGGCTC GATCGAGGCC
CCCTGGATGG CCAAGGCCGA AAAACAACCC AATCGCGAGG TCCCCGACGG CGGCGCCCTG
ATCGCCCAGC TGTCACCCAA TGAGTTCCTG ATCACCGGCT ACCGCGCCCG TGTCAGCTTC
GGTTCGGCCA AGGGCGAGCG GATGCTGATG GCGCGGGTCG AGGAAGGCCA TTTCGAGAAC
GGCCAGTGGG TCTTCGATCG CCTGTGGAAT GGCGACCAGA CCGACTACGG CCTGAACCTG
ACGACCTTGC CGCAAGTGCT GAAGGTCAAG CTGGCCACGT ATTAG
 
Protein sequence
MRNQDRVRPG RIRSLAMIAA LLAATASAGM GASTPAAAAD LPKFVAKDGR HALMVDGAPF 
LMLGVQVNNS SNYPSQLPKV WPAVKALQAN TVEVPIAWEQ IEPVEGRFDF SFLDVLLKQA
RENDVKLVLL WFGTWKNNAP NYAPEWVKLD NTRFPRVVTA KGETRNSLSP HFPATLEADK
KAFVQLMRHL KAADPDHTVI LVQPENETGV YSAVRDYSPA AQKLFEGPVP AELVKAMGKT
PGTWSQVFGK DADEYFHAWS IGRYVDQIAA AGKRELALPM YVNAALRDPF KDQDPYTYSS
GGPTWNVLDV WKAAAPSIDA IAPDIYMRES SNVRKTLAQY GRPDNPLFVP EIGDDKGFAR
YFYDVLGAHG LGFSPFGLDQ TGYSNYPLGA KTVDAQALET FAVHYRLLAP MARQWARLSY
EGKVWGAGEA DDRKAETLKL GDRWTATLSY GEWQFGSIEA PWMAKAEKQP NREVPDGGAL
IAQLSPNEFL ITGYRARVSF GSAKGERMLM ARVEEGHFEN GQWVFDRLWN GDQTDYGLNL
TTLPQVLKVK LATY