Gene Caul_2412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2412 
Symbol 
ID5899867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2633003 
End bp2634898 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content67% 
IMG OID641562903 
ProductAlpha-galactosidase 
Protein accessionYP_001684037 
Protein GI167646374 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00268404 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.284448 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTTT TCCGTATCGC CGCCGCCGTG GCCGCCACGC TCGTCATGTC GGCGACGCAG 
GCCGTCGCAG ATCCCCTCGC GCCCGTCGCG CGCTGGACCG CCTACGAGCG CGCCGCGGCG
CGCACCCCGC CGATGGGCTG GAACAGCTGG AACGCCTTCA CCAGCGACAT CGACGAAGAG
AAGATCATGG GCTCGGCCCG GATCCTGGTG AAGGCCGGCC TGGCGGATCG GGGCTATCGC
TACGTGAACA TCGATGACGG CTGGTGGCTC AAACGCCGCG CGTCGGACGG GCGCATGCTC
ATCCGCGCCG AGCGTTTCCC ATCGGCGGTG ACGGCCGACG GCGGGACCAG CTTTCGCCCC
CTGACCGATC GCCTCCACGC GATGGACCTC AAGGCCGGCA TCTATTCCGA CATCGGCCGC
AACAGCTGCG GTCAGGTGTT CACCTCCACG TTTCCAAACC AGCCGGAGGG CGACATCCGG
GAACGAGAAG TCGGTCTTTA CGGCCACGTC GACCAGGACA TCGCCCTCTA TTTCAAGGAC
TGGGGATTCG ATCTGATCAA GGTCGACGGC TGCGGCGTGC GCGGCTTGCC GGCCTCCGAT
CCCCGGGTGA AGGCCGGGCT CTATCGCGCG CTGGGTCCGC TGGTTGACGT CGACTCCCTG
GGAAGGACGG ACGTTCCGGC CGTCCGGGAT CTCTACAAGG CGGTGGGCGC CGCCCTGGAT
CGCTCCAACC CCGACGGCGA CTTCGTCTAT TCCATCTGCC TCTGGGGCGC GGCCGACGTT
CGCGCCTGGG GCAAGGATGT CGGCGCGATC TCGCGAACCA GCGAGGACAT TTCGCCGACC
TGGAGCCGGA TGCTGCATAA TCTCGACAGC GTCTCCCGGC GCGCCCTCTA TGCGCATCCA
GGCTCCTGGA ACGATCCAGA CATGCTCTAC GTGGGCAAGG GCGATTTCGA CGAAGCGCAT
CTGGTCGAGG CCCGCTCGCA TTTCGCGCTT TGGGCGATGG TCAACGCGCC GCTGATCATC
GGCTACGACT TGCGCACGGC CGCGCCGGCG CTTCTGGACA TCCTGGGCGC CAAGGAGATC
ATCGCGCTCA ATCAGGACTC GGCGGGCAAT CAGGCCGTGC TGGCTTTCGA TTCCGCCGAC
GTCTCCATCT TCGTCAAGAC ACTGGCCGGT GGCGACAAGG CGGTGGCGAT CCTCAATCGC
ACGGCCGCGC CCGCCGAGGC GGTGCTGACC GCGGATCATC TGAAGCTGCT GGGTACGGCC
GATGTCGAAC TGACGGACCT GTGGTCAGGC GCGGCCACCC GTTTTCGCGC CGAACGAAAG
TTCCAGCTGG CGCCGCGCCA GACGTTGATC TTCCGGGCCA AGGGCGCGCG CAAGCTGGCC
GACGGCGTCT TTCTCTCCGA ACAGCCCGGG TCGGTGAATC CCGCCGTCGA CGGCGTTGTG
ATCCCCCAAG CCGACCCGCT GATCCATCGT GCGATCCTGC CCTGGCGCGG AACACGCGGC
GTCGGCGAAC CCCCGCGCTA CGGGGGCTGG GGCGGGGCTC AAGCCGACCG CACGCCCTAT
GACCAGGAAC TGGCGATCGC CGGTCGGCGG TTCGACACTG GCCTCGGCGT CCTGGCCAAT
TCGCGCTTCG AAGTGCGCAA CGGCGGCTTT CGCCGCTTCA CCGCCAGCGT CGGCGTCGAC
GACTCCGCCG AGGATCGGTC ACGGCCCGTG ACCTTCTTCG TCTATGGCGA CGGCAAGCTT
CTGGCGCGCT CGCGGCCGGC GAGCTTCGGC CAGCCGCCGC AGGACCTCAG CGTCGAGGTG
TCGGGCGTCA AGTTGCTTGA ATTGGTCGCG CGCGTTTCTG GCCAATCGCG CCACCCAGAT
TCAGTAACCT GGGGCGACGC GGCGCTGCAT CGTTAG
 
Protein sequence
MTFFRIAAAV AATLVMSATQ AVADPLAPVA RWTAYERAAA RTPPMGWNSW NAFTSDIDEE 
KIMGSARILV KAGLADRGYR YVNIDDGWWL KRRASDGRML IRAERFPSAV TADGGTSFRP
LTDRLHAMDL KAGIYSDIGR NSCGQVFTST FPNQPEGDIR EREVGLYGHV DQDIALYFKD
WGFDLIKVDG CGVRGLPASD PRVKAGLYRA LGPLVDVDSL GRTDVPAVRD LYKAVGAALD
RSNPDGDFVY SICLWGAADV RAWGKDVGAI SRTSEDISPT WSRMLHNLDS VSRRALYAHP
GSWNDPDMLY VGKGDFDEAH LVEARSHFAL WAMVNAPLII GYDLRTAAPA LLDILGAKEI
IALNQDSAGN QAVLAFDSAD VSIFVKTLAG GDKAVAILNR TAAPAEAVLT ADHLKLLGTA
DVELTDLWSG AATRFRAERK FQLAPRQTLI FRAKGARKLA DGVFLSEQPG SVNPAVDGVV
IPQADPLIHR AILPWRGTRG VGEPPRYGGW GGAQADRTPY DQELAIAGRR FDTGLGVLAN
SRFEVRNGGF RRFTASVGVD DSAEDRSRPV TFFVYGDGKL LARSRPASFG QPPQDLSVEV
SGVKLLELVA RVSGQSRHPD SVTWGDAALH R