Gene Caul_3397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3397 
Symbol 
ID5900852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3667966 
End bp3669414 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content71% 
IMG OID641563903 
Productbeta-galactosidase 
Protein accessionYP_001685022 
Protein GI167647359 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0879388 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGATA AAGGCATTAG CCGTCGGGCG CTGGGGGCGC TCGCGGCGGG CGGCGTGGCG 
AGCGTCGGGC TGAGCGGCTG CGATCGGGTC GGCGCGACCG AGGCGACGGT CAAGTCACGG
CAATTTCCGG CGGATTTCGT CTGGGGCGTG GCCACGGCGG CCTTCCAGAC CGAGGGCTCG
CCGACCGCCG ACGGGCGCGG ACCCAGCATC TGGGACACCT TCCAGAACCA GCCGGGCCGC
ATCAAGGACG GCTCCACCGC CGACGTCGCC ACCGACAGCT ATCGCCGCTA CGCCGAGGAT
GTCGATCTGA TCGCCGGGGC GGGATTGAAG GCCTTCCGCT TCTCGATCGC CTGGTCGCGG
GTGCTGCCGA CCGGGGAGGG CACGGTCAAC GCGGCGGGGC TGGACCACTA TGACCGCCTG
GTCGACGCCT GCCTGGCCAA GGGGATCACC CCCTACGCCA CGCTGTTTCA CTGGGACCTG
CCCCAGGCCC TGCAGGACAA GGGCGGCTGG AGCGCGCGCG ACACCGCCAG CAGCTTTGGC
GACTACGCCG CCGCCGTGGC GGCGCGGCTG GGCGACCGGC TCAAGCACGT CATCACCCTG
AACGAACCGG CCGTGCACAC GGTGTTCGGC CACGTGCTGG GCGAGCATGC CCCGGGGCTG
AAGGACATCG CCCTGCTGGG GCCGACCACC CACCACATGA ACCTCGGACA GGGGCTGGCG
ATCCAGGCCC TGCGCGCGGC GCGCGGCGAC CTGCGGATCG GCACGACCCA GGCCTTGCAG
CCCTGCCGGG CGTCGGGCGG GCCGCTGGCG TTCTGGAACC GTCCGGCGGC GGACGGGCTG
GACGCCCTGT GGAACCGCGC CTGGCTGGAT CCGCTGCTGA AGGGGACCTA TCCGGCCCTG
ATGGACGACT TCCTCAAGGG CCATGTCCGC GACGGCGACC TGAAGACCAT CCGCCAGCCG
ATCGACTTCC TGGGGGTCAA TTACTATGCG CCGGCCTATG TGAAGCTGGA CCTCGGCAAC
GCCAGCCACA TCGCGCCGGG CTCGCCACCG AGGGGCGCGG AGCTGGACGC CTTCGGCCGC
CAGATCGATC CTTCGGGCCT GGTCCAGGTG CTCGAGATGG TGCGCCGCGA CTACGGCAAT
CCGCCGGTGC TGATCACCGA GAACGGCTGC TCGGACCCGT TCGGACCTGG TCCGGGCGTG
ATCGACGACG GCTTCCGCGG CCAATACCTG CGCCGGCACC TGGAGGCGGT GAAGAGCGCG
ACGGAGGCCG GTTCGCGGAT CGGCGGCTAT TTCACCTGGA CCCTGGTCGA CAACTGGGAG
TGGGACCTGG GCTACACGTC AAAGTTCGGC CTGGTGTCGC TGGACCGCGC GACGGGCGCG
CGGACACCCA AGGCGTCGTA TGGCTGGTTC AAGGGCGTGG CGGAGAGCGG GCTGCTCCCC
GCCGCCTGA
 
Protein sequence
MGDKGISRRA LGALAAGGVA SVGLSGCDRV GATEATVKSR QFPADFVWGV ATAAFQTEGS 
PTADGRGPSI WDTFQNQPGR IKDGSTADVA TDSYRRYAED VDLIAGAGLK AFRFSIAWSR
VLPTGEGTVN AAGLDHYDRL VDACLAKGIT PYATLFHWDL PQALQDKGGW SARDTASSFG
DYAAAVAARL GDRLKHVITL NEPAVHTVFG HVLGEHAPGL KDIALLGPTT HHMNLGQGLA
IQALRAARGD LRIGTTQALQ PCRASGGPLA FWNRPAADGL DALWNRAWLD PLLKGTYPAL
MDDFLKGHVR DGDLKTIRQP IDFLGVNYYA PAYVKLDLGN ASHIAPGSPP RGAELDAFGR
QIDPSGLVQV LEMVRRDYGN PPVLITENGC SDPFGPGPGV IDDGFRGQYL RRHLEAVKSA
TEAGSRIGGY FTWTLVDNWE WDLGYTSKFG LVSLDRATGA RTPKASYGWF KGVAESGLLP
AA