Gene Caul_0144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0144 
Symbol 
ID5897856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp159878 
End bp161485 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content70% 
IMG OID641560629 
ProductAlpha,alpha-trehalase 
Protein accessionYP_001681780 
Protein GI167644117 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1626] Neutral trehalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.994259 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCATC GCCGCCTACT CGCCTCCGCA ATCTGGCTGG CCATCGCCTC CCCGGCCATC 
GCCCAGACCG CGACGCCCAA GACCAGCGAG ATCCCTTCTC CCGCCGACAC CTATGCCGAG
CTGTTCCACC AGGTGCAGAT GCGCAAGCTG TTCCCGGACG GCAAGACCTT CGTCGACGCC
ACGCCGAAGC GTCAGCCGGG CCAGATCCTG GCCGCCTACC GCGCCCACGC CGCCTTCACC
GACGCCGAGT TGAAGCGCTT CGTGCGGGCC AACTTCGCCG TACCGGAAAG CGCGCCGCTG
CCGTCCCCCT CCAAGGACCG CACCACCCTG AAGGCCCACA TCGCCGCCCT GTGGCCGGTG
CTGACCCGCC CGCCGGTCAA GGCGGTGGAG GGCGACAGCG CCCTGCCGCT GGACAAGCCG
TTCGTGGTGC CGGGCGGGCG CTTTCGCGAG ATGTATTACT GGGACAGCTA TTTCACGCTG
CTGGGCCTGG CCGCCGACGG CAAGACCGAG GCGGTCGAGA ACATGGTCGA TGATTTCGGC
GGGCTGATCG ACCGCTACGG ACACATCCCC AACGGCACGC GCACCTACTA TCTCAGCCGC
TCGCAGCCGC CGTTCTACTT CGCCATGGTG GGCCTCGCTC AGAAAGATGG GGCCGACAAG
ACCGACAAGG CGCGGTTCAA GGCCCGCCTC GACCTGATGC GCCGCGAGCA CGCCTTCTGG
ATGGACGGCG AAAAGAGCGT GAGGCCGGGC CAAGCCTGGC GCCGCGTTGT CGCCCTGCCC
GACGGGGCGA TCCTGAATCG CTACGCCGAC GACCGCGCCA CCCCGCGCGA CGAGTCCTAT
CGCGAGGACG TGCTGACGGC GCGGGAAGTC ACCAGCCGTC CGGCCGGCGA TGTCTTCCGC
GACCTGCGCT CGGGGGCCGA GAGCGGCTGG GACTTCAGCT CGCGCTGGAT GGCCGACGGC
CAGAGCCTGA AGACCATCCA GACCACCAGC ATCGTGCCGG TCGACCTCAA CAGCCTGATG
TACGGCCTGG AGACGGCGAT CGCCCAGGGC TGCGCCGAGC TGGTGGACGC GCCCTGCGTC
GCCGAGTTCA GCGACCGCGC CAAGGCCCGC AAGACGGCCA TGGACGCCTA TCTCTGGGAC
GCCCCGCGCG GCCTGTATCT CGACTACCAG TGGCGCGACC ATGGACGGCT GGATCACCCC
AGCGCCGCGA CGCTGTATCC CCTGTTCGTC GGCGCCGCGA GCCCGGACCA GGCCAGGGCC
GTGGCGGCGA CGACCCGCGC CCTGCTGCTG GCCCCCGGCG GCCTGCGCAC CACCACCGCC
TCGACCGGTC AGCAATGGGA TACACCCAAC GGCTGGGCTC CCCTGCAATG GGTGGCGGTG
TCGGGCCTGC GCCGCTATGG CGAGGAGGCC CTGGCCAGGG ATATCGGCCA GCGCTGGCTG
GCCACCGTCC AGCGCGAGTA CCAGGCCAGC GGCAAGATGC TGGAGAAGTA CGACGTCGAG
GAGGCCAAGG CCGGGGGCGG CGGCGAGTAT CCGTTGCAGG ACGGCTTTGG TTGGACCAAC
GGGGTGACGC GGGCGCTGCT CGATCTCTAT CCGGCGGCGG CGCACTAG
 
Protein sequence
MKHRRLLASA IWLAIASPAI AQTATPKTSE IPSPADTYAE LFHQVQMRKL FPDGKTFVDA 
TPKRQPGQIL AAYRAHAAFT DAELKRFVRA NFAVPESAPL PSPSKDRTTL KAHIAALWPV
LTRPPVKAVE GDSALPLDKP FVVPGGRFRE MYYWDSYFTL LGLAADGKTE AVENMVDDFG
GLIDRYGHIP NGTRTYYLSR SQPPFYFAMV GLAQKDGADK TDKARFKARL DLMRREHAFW
MDGEKSVRPG QAWRRVVALP DGAILNRYAD DRATPRDESY REDVLTAREV TSRPAGDVFR
DLRSGAESGW DFSSRWMADG QSLKTIQTTS IVPVDLNSLM YGLETAIAQG CAELVDAPCV
AEFSDRAKAR KTAMDAYLWD APRGLYLDYQ WRDHGRLDHP SAATLYPLFV GAASPDQARA
VAATTRALLL APGGLRTTTA STGQQWDTPN GWAPLQWVAV SGLRRYGEEA LARDIGQRWL
ATVQREYQAS GKMLEKYDVE EAKAGGGGEY PLQDGFGWTN GVTRALLDLY PAAAH