Gene Caul_3276 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3276 
Symbol 
ID5900731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3538344 
End bp3539549 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content64% 
IMG OID641563782 
Productcarbohydrate-binding family 6 protein 
Protein accessionYP_001684901 
Protein GI167647238 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACT GGCGCCTGTA TAGCTCGACG GACATGGTCA ACTGGACCGA TCGGGGCGTG 
GTGGCCTCGC TCAAGACCTT CCCTTGGGCC GTGCAGACCA ACGACGCCTG GGCGCCGCAG
GTCATCGCCC GCGACGGCAA GTTCTATCTC TTTGTCCCGA TCAGCGTCGC CGGCTCGCCC
AAGAACGTCA TCGCCGTGGC GGTGGCGGAT AATCCCGCGG GCCCGTTCAA GGACGCGCTC
GGCAAGCCGT TGATCGGGCC GGCTCGCGAC AACATCGATC CCACGGCGTG GATCGACGAC
GATGGCCAAG CCTATCTCTA CTGGGGCAAC CCGAACCTTT GGTACGTCAA GCTGAACAAG
GACATGGTGT CCTATTCAGG TCCTATCACC AAGATCGAGC CTGGCGTCCG CGACTACCAA
GAGGGGCCTT GGTTCTACAA ACGCACCGGC CGCTACTACA TGGCGTTCGC CTCGACCTGC
TGTCCGGAGG GTATCGGCTA TGCGATGAGC GACAAGCCGA CGGGGCCCTG GGCCTACAAG
GGTCCGATCA TGGACCATGA CCCGCGCGCG ACCGGCAATC ATCCAGGGAT CATCGACTAT
AAGGGCGGCT CCTATGTCTT CGGCTTCAGC TACGAACACA ATTTCGCGCT GACCCCGATC
CATCACGAAC GCCGATCCGT GTCGGTGGCC AAGTTCGACT ACAACGCCGA CGGAACCATT
CCGAACCTTG GCTGGTGGGA CAAGACCAGC GCGCCGCAGA TCGGGACGCT TGACCCTTAC
AAGCGCGTGG AGGCCGAAAC GATCGCCTGG ACGTCGCGTC TCAAGCGAGA TCGCGATCGT
CCCTACGCCT GGGCGCCGGG CGTGACGACG GCGCAGGACG ATCGCGCGGG CGTGTACGTC
ACGCGGATCA CAGACCGCAG CTACATCAAG GTCGCCGGCG TCGACTTCGG CCAGACCGGC
GCCAAGACCT TCGTCGCCAG CCTGGCCAAC GAACGGCCCG GCGGGGCCAT CGAGTTGAGA
CTCGACAGGG TCGACGGCCC GGTGATCGGC ACGGTGCAGG TCGGGACGAC CGGCGCGGCG
GGCCAATGGC GCGAGCTAGG GACCCCCGTC TCGGTGGCGA CGGGGGTGCG CGACCTCTTC
CTGGTGTTCA AGGGATCGGG CGACAACCCG ATGTTCGATT TCGACTACTG GCGCTTCGAG
CAGTGA
 
Protein sequence
MLDWRLYSST DMVNWTDRGV VASLKTFPWA VQTNDAWAPQ VIARDGKFYL FVPISVAGSP 
KNVIAVAVAD NPAGPFKDAL GKPLIGPARD NIDPTAWIDD DGQAYLYWGN PNLWYVKLNK
DMVSYSGPIT KIEPGVRDYQ EGPWFYKRTG RYYMAFASTC CPEGIGYAMS DKPTGPWAYK
GPIMDHDPRA TGNHPGIIDY KGGSYVFGFS YEHNFALTPI HHERRSVSVA KFDYNADGTI
PNLGWWDKTS APQIGTLDPY KRVEAETIAW TSRLKRDRDR PYAWAPGVTT AQDDRAGVYV
TRITDRSYIK VAGVDFGQTG AKTFVASLAN ERPGGAIELR LDRVDGPVIG TVQVGTTGAA
GQWRELGTPV SVATGVRDLF LVFKGSGDNP MFDFDYWRFE Q