Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1778 |
Symbol | |
ID | 5899233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1878025 |
End bp | 1879821 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641562268 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_001683405 |
Protein GI | 167645742 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.743693 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0628922 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCAGA ACCTCGACCT CTTCCCGATC GGCAACTGCG CGGTCAGCGC CCTGATCGAC CGTGCGGGCC GCTTCGTCTG GGCCTGCGCG CCGCGCATCG ATTCCGACCC GGTGTTCAGC GCGCTGCTGG GCGGGTTGGA GCCGGGCGAT CCCACCGCCC GGGGCACGTG GGAGGTCGCC GTCGACGGCG CCAAGACCGT CGAACAGGCG TACCTGCGCA ACACCCCGAT CCTGCGCACG GTGATCACCG ACGCCGACGG CGCCAGCCTC GAAATTCTCG ATTTCGCCCC GCGCTACCAG CAGTACGGCC GCAGCTTCCG TCCAACCGCC TTCATCCGCC TGATCCGTCC GCTGACCAGT GTCGCGCGCA TCACGATCCG CCTGCGCCCG ACCGCCGACT GGGGCGCGCG GGCGGCCGAG ACCACCCACG GCTCGAACCA CATTCGCTAC CTCTGCTCCG ACATGACCTT GCGCCTGTCG ACCGACGGGC CGGTGTCCCA CGTGCTGGAG GAGCGCGCCT TCCGACTGGA AAAGCCGATC GCCATGTTCC TGGGCGCCGA CGAGGGCTTC AACGCCGACA TCGGGGCCAC CTGCAACCGC ATGCTACAGC AGACCCAGGA ATACTGGATG GACTGGGTGC GGGGCCTGGC CGTGCCGCTC GACTACCAGG CCGCCGTGAT CCGCGCGGCG ATCACGCTGA AGCTGTGCAT GCACGAGGAG ACCGGGGCGA TCGTGGCGGC CATGACCACC TCGATCCCCG AGCACGCCGA CAGCGGCCGC AACTGGGACT ATCGCTACTG CTGGCTGCGC GACGCCTATT ACGTGGTCCA AGCGCTGAAC CGCCTGGGCG CGGTGGACAT CCTGGAGAAC TATCTGGGCT ATCTGCGCAA CATCGTCGAC CGGGCGGCGG GCGGCCACAT CCAGCCGCTG TTCGGCGTGG GGTTCGAGCC GCAGCTGACC GAGCGCTTCG CCCCCGCCCT GCCCGGCTAT CGCGGCATGG GACCCGTGCG CATCGGCAAC CAGGCCTTCG AGCACCAGCA GCACGACGTC TACGGCCAGA TCATCCTGTC GACCGTCCAG GCCTTCTTCG ACGAGCGCCT GCTGAGGCCT GGCACGGTCG AGGACTTCCA CAATCTCGAA CCTGTCGGCG AGCGGGCCTT CCAGCTTCAT GACCAGCCCG ACGCCAGCCT GTGGGAGTTC CGCGGCCGGG CCAATGTCCA CACCTATTCG GCGGTGATGT GCTGGGCGGC CTGCGACCGG CTGGGCAACG CCGCCGCCCG CCTGGGCCTG ACCGAACGCG CCGACTTCTG GAACGCGCGC GCCGCCCAGG TGCGCGCCAC CATCGACGAG CGGGCCTGGA ACGAGGAACT GGGCCGCTTC GCCGCCACCT TCGAAGGCGA CGAACTGGAC GCCAGCCTGC TGCAACTGGT CGATCTGCGC TTCATCGAGG CCAACGACCC GCGCAACGTG GCCACCGTCG CCGCCGTCGA GGCGGGCCTG CGCAAGGGCT CCTACCTGCT GCGCTACGCC ATCCCCGACG ACTTCGGCGC GCCGCAGACG GCGTTCAACA TCTGCACCTT CTGGCTGGTC GAGGCCCTCT ACCTGGCCGG CCGCATCGAC GAGGCCCGCG ACCTGTTCGA GGAGATGCTG GCGCGCCGCA CCACGGCTGG CCTGCTCTCG GAAGACATAG GCTTCGCGGA CGGCGAGCTG TGGGGCAACT ATCCGCAGAC CTACTCGCTG GTCGGGCTGA TCAATTGCGC GGTGCTGCTG AGCAGGCCTT GGACGTCGGT GCGCTGA
|
Protein sequence | MPQNLDLFPI GNCAVSALID RAGRFVWACA PRIDSDPVFS ALLGGLEPGD PTARGTWEVA VDGAKTVEQA YLRNTPILRT VITDADGASL EILDFAPRYQ QYGRSFRPTA FIRLIRPLTS VARITIRLRP TADWGARAAE TTHGSNHIRY LCSDMTLRLS TDGPVSHVLE ERAFRLEKPI AMFLGADEGF NADIGATCNR MLQQTQEYWM DWVRGLAVPL DYQAAVIRAA ITLKLCMHEE TGAIVAAMTT SIPEHADSGR NWDYRYCWLR DAYYVVQALN RLGAVDILEN YLGYLRNIVD RAAGGHIQPL FGVGFEPQLT ERFAPALPGY RGMGPVRIGN QAFEHQQHDV YGQIILSTVQ AFFDERLLRP GTVEDFHNLE PVGERAFQLH DQPDASLWEF RGRANVHTYS AVMCWAACDR LGNAAARLGL TERADFWNAR AAQVRATIDE RAWNEELGRF AATFEGDELD ASLLQLVDLR FIEANDPRNV ATVAAVEAGL RKGSYLLRYA IPDDFGAPQT AFNICTFWLV EALYLAGRID EARDLFEEML ARRTTAGLLS EDIGFADGEL WGNYPQTYSL VGLINCAVLL SRPWTSVR
|
| |