Gene Caul_4792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4792 
Symbol 
ID5902254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5180557 
End bp5182383 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content63% 
IMG OID641565312 
Productglycoside hydrolase family protein 
Protein accessionYP_001686410 
Protein GI167648747 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2273] Beta-glucanase/Beta-glucan synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGC AGCTTTTGAA CATGAACGCG GACTCGGTTA TACTGGCCAC GAGCATGGCG 
AGCCGAACCG TCTGGTATGG GTCGGGTAAC CGCGAAATCA TCACCGGCGG CGATCAGAGC
GATGGATTAC GCGGCAATGG CGGCGGCGAC ACCCTGGTCG GCGGCAAGGG CGACGACACC
TATTACGCCC ACCCGGACGA CATTGTCGTC GAGAAGGCTA ACGGCGGCGT CGACACGATC
TCGTCCCTCG ACAATTTCAC CCTGCCCGAC CACGTCGAAA ACCTGACGAT CACCGGGTTG
AACAAGACCG GGGTCGGCAA CAGCCTGTCC AATATTCTGT CCGGCGGGAA CGGAACCCAG
ACCCTGAACG GCAAGGGCGG TCTGGACGTG CTGATCGGCG GCGCCGACGC CGACATCTTC
GTGATGGAAA GGGGCAATTC GCGCGACGTC ATCAACGACT TCGCGGTCGG GATCGATCAT
GCCCGGTTGT CGGGCTTCAC GTTCACCAGC TTCGACCAGG TCCGCGCCAA GATGAAGCAG
GCCGGTTCGG ACGTCGTGCT GGACCTCGGC GGCGGCGACC AGTTGACATT CCGCGAGCAC
AAGATCGCCG ACTTCTCGGC CGCCGACTTC CAGCTGGGCC TGGACAAGAG CGGCATGAAG
CTGACCTTCA GCGATGAGTT CAACGGGTTG AGCCTGTGGA ACGGCAAGAC CGGCACCTGG
CGGACCGAAT TCGGCTATGG CGGAGAGGGC AGCCTGGCCA GCCGCGACGG TATCCGCGCG
ATCTACGTGG ACGCCCAATT CAAGGGCACC GGCAAGAAGG CGCTCGGCGT CGATCCGTTC
GATATCGACA ACGGCGTGCT CAGCATCACC GCCGCGCCAG CCACCGCCGC CGTCAAGGCG
CAGATCTGGA ACCACACCTA TACTTCAGGC GTCCTGACGA CGAAGTTCTC GTTCGCCCAA
GAGTACGGCT ACTTCGAGAT CAAGGCCAAG CTACCGGCGG GCCAGGGCTT CTTTCCGGCC
TTCTGGCTGT TGCCGACCGA CGGCTCCTGG CCGCCGGAAA TCGACATCTT CGAGAGCCTC
GGCAAGGATC CCGACACCAT CTACACCACC AGTCACAGCA ACAGCAGCGG CAAGATGGTC
AGCGACGCGT CGTCGGTCCG GATCGAGGGC GCGGCCACGT CGTTCCACAC CTATGGCCTG
GATTGGAGCG AAGACTATCT CGTCTGGTAC GTCGATGGCG TCGAGGTGGA GCGTCAGCAG
ACGCCGGACG ACATGCACAA GCCGATGTAC ATGCTGATCA ACCTGGCGGT CGGCGGCGGC
TGGGCCGGCG AGCCGACGGC GGCCACTGGC TCCGGCCAGT TGGAAATCGA CTATGTCCGC
ACCTACGCGC ACGATGCATC CCCGCCACCA CCACCACCAC CGCCTCCTCC TCCTCCTCCT
CCTCCTCCTC CTCCTCCTCC TCCTCCTCCT CCGCCTCCTC CTCCGCCGCC GCCGGTCGAA
ACCTCGCCGA AGATCGTTCT GGCGGGCACC ACGGCCAATG ACAGCTTCAT TTTGAAGGCG
GCGATGTTCG ACACCGACGC GACAGGGGTG CAGGCGAGCA TCAAGTCATT CGGCGGGGCC
AGCGGGTGGT CGTCGAACAA CAACGACTTC GTCAGCCTGA CCGGCTGGTC GAAGGGCTCC
AGCTTCACCT GGGACCACGA CGACGCCAAC GACGCCCATG TCGGTTTCTA TCGGATCCAC
GACGCGTCTC TGAACAAGGA CGTCATGATC CAGATCACCA CCGTCACCGG CCAACATGTC
AGCGCGGGGG ACTTCAACTT CTACTGA
 
Protein sequence
MAKQLLNMNA DSVILATSMA SRTVWYGSGN REIITGGDQS DGLRGNGGGD TLVGGKGDDT 
YYAHPDDIVV EKANGGVDTI SSLDNFTLPD HVENLTITGL NKTGVGNSLS NILSGGNGTQ
TLNGKGGLDV LIGGADADIF VMERGNSRDV INDFAVGIDH ARLSGFTFTS FDQVRAKMKQ
AGSDVVLDLG GGDQLTFREH KIADFSAADF QLGLDKSGMK LTFSDEFNGL SLWNGKTGTW
RTEFGYGGEG SLASRDGIRA IYVDAQFKGT GKKALGVDPF DIDNGVLSIT AAPATAAVKA
QIWNHTYTSG VLTTKFSFAQ EYGYFEIKAK LPAGQGFFPA FWLLPTDGSW PPEIDIFESL
GKDPDTIYTT SHSNSSGKMV SDASSVRIEG AATSFHTYGL DWSEDYLVWY VDGVEVERQQ
TPDDMHKPMY MLINLAVGGG WAGEPTAATG SGQLEIDYVR TYAHDASPPP PPPPPPPPPP
PPPPPPPPPP PPPPPPPPVE TSPKIVLAGT TANDSFILKA AMFDTDATGV QASIKSFGGA
SGWSSNNNDF VSLTGWSKGS SFTWDHDDAN DAHVGFYRIH DASLNKDVMI QITTVTGQHV
SAGDFNFY