Gene Caul_1778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1778 
Symbol 
ID5899233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1878025 
End bp1879821 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content69% 
IMG OID641562268 
Productglycoside hydrolase 15-related 
Protein accessionYP_001683405 
Protein GI167645742 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.743693 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0628922 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCAGA ACCTCGACCT CTTCCCGATC GGCAACTGCG CGGTCAGCGC CCTGATCGAC 
CGTGCGGGCC GCTTCGTCTG GGCCTGCGCG CCGCGCATCG ATTCCGACCC GGTGTTCAGC
GCGCTGCTGG GCGGGTTGGA GCCGGGCGAT CCCACCGCCC GGGGCACGTG GGAGGTCGCC
GTCGACGGCG CCAAGACCGT CGAACAGGCG TACCTGCGCA ACACCCCGAT CCTGCGCACG
GTGATCACCG ACGCCGACGG CGCCAGCCTC GAAATTCTCG ATTTCGCCCC GCGCTACCAG
CAGTACGGCC GCAGCTTCCG TCCAACCGCC TTCATCCGCC TGATCCGTCC GCTGACCAGT
GTCGCGCGCA TCACGATCCG CCTGCGCCCG ACCGCCGACT GGGGCGCGCG GGCGGCCGAG
ACCACCCACG GCTCGAACCA CATTCGCTAC CTCTGCTCCG ACATGACCTT GCGCCTGTCG
ACCGACGGGC CGGTGTCCCA CGTGCTGGAG GAGCGCGCCT TCCGACTGGA AAAGCCGATC
GCCATGTTCC TGGGCGCCGA CGAGGGCTTC AACGCCGACA TCGGGGCCAC CTGCAACCGC
ATGCTACAGC AGACCCAGGA ATACTGGATG GACTGGGTGC GGGGCCTGGC CGTGCCGCTC
GACTACCAGG CCGCCGTGAT CCGCGCGGCG ATCACGCTGA AGCTGTGCAT GCACGAGGAG
ACCGGGGCGA TCGTGGCGGC CATGACCACC TCGATCCCCG AGCACGCCGA CAGCGGCCGC
AACTGGGACT ATCGCTACTG CTGGCTGCGC GACGCCTATT ACGTGGTCCA AGCGCTGAAC
CGCCTGGGCG CGGTGGACAT CCTGGAGAAC TATCTGGGCT ATCTGCGCAA CATCGTCGAC
CGGGCGGCGG GCGGCCACAT CCAGCCGCTG TTCGGCGTGG GGTTCGAGCC GCAGCTGACC
GAGCGCTTCG CCCCCGCCCT GCCCGGCTAT CGCGGCATGG GACCCGTGCG CATCGGCAAC
CAGGCCTTCG AGCACCAGCA GCACGACGTC TACGGCCAGA TCATCCTGTC GACCGTCCAG
GCCTTCTTCG ACGAGCGCCT GCTGAGGCCT GGCACGGTCG AGGACTTCCA CAATCTCGAA
CCTGTCGGCG AGCGGGCCTT CCAGCTTCAT GACCAGCCCG ACGCCAGCCT GTGGGAGTTC
CGCGGCCGGG CCAATGTCCA CACCTATTCG GCGGTGATGT GCTGGGCGGC CTGCGACCGG
CTGGGCAACG CCGCCGCCCG CCTGGGCCTG ACCGAACGCG CCGACTTCTG GAACGCGCGC
GCCGCCCAGG TGCGCGCCAC CATCGACGAG CGGGCCTGGA ACGAGGAACT GGGCCGCTTC
GCCGCCACCT TCGAAGGCGA CGAACTGGAC GCCAGCCTGC TGCAACTGGT CGATCTGCGC
TTCATCGAGG CCAACGACCC GCGCAACGTG GCCACCGTCG CCGCCGTCGA GGCGGGCCTG
CGCAAGGGCT CCTACCTGCT GCGCTACGCC ATCCCCGACG ACTTCGGCGC GCCGCAGACG
GCGTTCAACA TCTGCACCTT CTGGCTGGTC GAGGCCCTCT ACCTGGCCGG CCGCATCGAC
GAGGCCCGCG ACCTGTTCGA GGAGATGCTG GCGCGCCGCA CCACGGCTGG CCTGCTCTCG
GAAGACATAG GCTTCGCGGA CGGCGAGCTG TGGGGCAACT ATCCGCAGAC CTACTCGCTG
GTCGGGCTGA TCAATTGCGC GGTGCTGCTG AGCAGGCCTT GGACGTCGGT GCGCTGA
 
Protein sequence
MPQNLDLFPI GNCAVSALID RAGRFVWACA PRIDSDPVFS ALLGGLEPGD PTARGTWEVA 
VDGAKTVEQA YLRNTPILRT VITDADGASL EILDFAPRYQ QYGRSFRPTA FIRLIRPLTS
VARITIRLRP TADWGARAAE TTHGSNHIRY LCSDMTLRLS TDGPVSHVLE ERAFRLEKPI
AMFLGADEGF NADIGATCNR MLQQTQEYWM DWVRGLAVPL DYQAAVIRAA ITLKLCMHEE
TGAIVAAMTT SIPEHADSGR NWDYRYCWLR DAYYVVQALN RLGAVDILEN YLGYLRNIVD
RAAGGHIQPL FGVGFEPQLT ERFAPALPGY RGMGPVRIGN QAFEHQQHDV YGQIILSTVQ
AFFDERLLRP GTVEDFHNLE PVGERAFQLH DQPDASLWEF RGRANVHTYS AVMCWAACDR
LGNAAARLGL TERADFWNAR AAQVRATIDE RAWNEELGRF AATFEGDELD ASLLQLVDLR
FIEANDPRNV ATVAAVEAGL RKGSYLLRYA IPDDFGAPQT AFNICTFWLV EALYLAGRID
EARDLFEEML ARRTTAGLLS EDIGFADGEL WGNYPQTYSL VGLINCAVLL SRPWTSVR