Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0144 |
Symbol | |
ID | 5897856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 159878 |
End bp | 161485 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641560629 |
Product | Alpha,alpha-trehalase |
Protein accession | YP_001681780 |
Protein GI | 167644117 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1626] Neutral trehalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.994259 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGCATC GCCGCCTACT CGCCTCCGCA ATCTGGCTGG CCATCGCCTC CCCGGCCATC GCCCAGACCG CGACGCCCAA GACCAGCGAG ATCCCTTCTC CCGCCGACAC CTATGCCGAG CTGTTCCACC AGGTGCAGAT GCGCAAGCTG TTCCCGGACG GCAAGACCTT CGTCGACGCC ACGCCGAAGC GTCAGCCGGG CCAGATCCTG GCCGCCTACC GCGCCCACGC CGCCTTCACC GACGCCGAGT TGAAGCGCTT CGTGCGGGCC AACTTCGCCG TACCGGAAAG CGCGCCGCTG CCGTCCCCCT CCAAGGACCG CACCACCCTG AAGGCCCACA TCGCCGCCCT GTGGCCGGTG CTGACCCGCC CGCCGGTCAA GGCGGTGGAG GGCGACAGCG CCCTGCCGCT GGACAAGCCG TTCGTGGTGC CGGGCGGGCG CTTTCGCGAG ATGTATTACT GGGACAGCTA TTTCACGCTG CTGGGCCTGG CCGCCGACGG CAAGACCGAG GCGGTCGAGA ACATGGTCGA TGATTTCGGC GGGCTGATCG ACCGCTACGG ACACATCCCC AACGGCACGC GCACCTACTA TCTCAGCCGC TCGCAGCCGC CGTTCTACTT CGCCATGGTG GGCCTCGCTC AGAAAGATGG GGCCGACAAG ACCGACAAGG CGCGGTTCAA GGCCCGCCTC GACCTGATGC GCCGCGAGCA CGCCTTCTGG ATGGACGGCG AAAAGAGCGT GAGGCCGGGC CAAGCCTGGC GCCGCGTTGT CGCCCTGCCC GACGGGGCGA TCCTGAATCG CTACGCCGAC GACCGCGCCA CCCCGCGCGA CGAGTCCTAT CGCGAGGACG TGCTGACGGC GCGGGAAGTC ACCAGCCGTC CGGCCGGCGA TGTCTTCCGC GACCTGCGCT CGGGGGCCGA GAGCGGCTGG GACTTCAGCT CGCGCTGGAT GGCCGACGGC CAGAGCCTGA AGACCATCCA GACCACCAGC ATCGTGCCGG TCGACCTCAA CAGCCTGATG TACGGCCTGG AGACGGCGAT CGCCCAGGGC TGCGCCGAGC TGGTGGACGC GCCCTGCGTC GCCGAGTTCA GCGACCGCGC CAAGGCCCGC AAGACGGCCA TGGACGCCTA TCTCTGGGAC GCCCCGCGCG GCCTGTATCT CGACTACCAG TGGCGCGACC ATGGACGGCT GGATCACCCC AGCGCCGCGA CGCTGTATCC CCTGTTCGTC GGCGCCGCGA GCCCGGACCA GGCCAGGGCC GTGGCGGCGA CGACCCGCGC CCTGCTGCTG GCCCCCGGCG GCCTGCGCAC CACCACCGCC TCGACCGGTC AGCAATGGGA TACACCCAAC GGCTGGGCTC CCCTGCAATG GGTGGCGGTG TCGGGCCTGC GCCGCTATGG CGAGGAGGCC CTGGCCAGGG ATATCGGCCA GCGCTGGCTG GCCACCGTCC AGCGCGAGTA CCAGGCCAGC GGCAAGATGC TGGAGAAGTA CGACGTCGAG GAGGCCAAGG CCGGGGGCGG CGGCGAGTAT CCGTTGCAGG ACGGCTTTGG TTGGACCAAC GGGGTGACGC GGGCGCTGCT CGATCTCTAT CCGGCGGCGG CGCACTAG
|
Protein sequence | MKHRRLLASA IWLAIASPAI AQTATPKTSE IPSPADTYAE LFHQVQMRKL FPDGKTFVDA TPKRQPGQIL AAYRAHAAFT DAELKRFVRA NFAVPESAPL PSPSKDRTTL KAHIAALWPV LTRPPVKAVE GDSALPLDKP FVVPGGRFRE MYYWDSYFTL LGLAADGKTE AVENMVDDFG GLIDRYGHIP NGTRTYYLSR SQPPFYFAMV GLAQKDGADK TDKARFKARL DLMRREHAFW MDGEKSVRPG QAWRRVVALP DGAILNRYAD DRATPRDESY REDVLTAREV TSRPAGDVFR DLRSGAESGW DFSSRWMADG QSLKTIQTTS IVPVDLNSLM YGLETAIAQG CAELVDAPCV AEFSDRAKAR KTAMDAYLWD APRGLYLDYQ WRDHGRLDHP SAATLYPLFV GAASPDQARA VAATTRALLL APGGLRTTTA STGQQWDTPN GWAPLQWVAV SGLRRYGEEA LARDIGQRWL ATVQREYQAS GKMLEKYDVE EAKAGGGGEY PLQDGFGWTN GVTRALLDLY PAAAH
|
| |