Gene Ccur_05950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcur_05950 
Symbol 
ID8374803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptobacterium curtum DSM 15641 
KingdomBacteria 
Replicon accessionNC_013170 
Strand
Start bp699180 
End bp701441 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content51% 
IMG OID644993518 
Productglucan-binding domain-containing protein 
Protein accessionYP_003150994 
Protein GI256827035 
COG category[R] General function prediction only 
COG ID[COG5263] FOG: Glucan-binding domain (YG repeat) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000330124 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.0000000000107499 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACGAG AAATCTCACG TCGTGGATTT GTGATTGGTG GAGCCTATGC TGGCATGGCA 
GCAGCAACGG GAGCATTCAT GTTTAGTCCT TCAGCTGCTT TTGCTGCTGA TGTAACGCGT
ATTCATGTGA TGGCGTTTAC CGACATGGAT GCCATCGTGC TTGAAAGCAA TGGGCGGTTT
GCGATGGTCG ATTCGGGGGA GGATAGTACC TATCCCGGTG GATCAGATGC GCGGTACCCT
TGGCGTGATG GTATTACTAA GGGAAACGGC CATGAAAACG AAGTTATTTC TTACTTGCAT
AGCCTTGGTG CCACTGAGTC GAACTTCGAA TTTTATCTCG GTACTCACCC GCATAGCGAC
CACATTGGAT CGGCGAGCCA GGTTATTAAT GAGTTTCGAC CCAAACGTGT TTATACGCCC
GAATATGACG ATTCATACAT TACTGATTCC AATGCATTGT GGGATAACCA GTACTGCTAC
GATCGACTGG TAGAGGCGGC CCATAATGTA GGGGCAACTC TTATCACGAG CTTCGATACG
TCTGCGCCCA TTGACCCCGT TGCGGATGCT TCAGCTGCCC AGCGTAGCGC TTCTGTTGAA
ACTGATCCGT CAGCTGGAAC AGATGCCCCC GCTGCTGAAT CGCTTTCGCG TGAGGATATT
ATCGAGCGTT TTGGCGTTGA TCCTACCGAT CCGCATGACC CTAATAATTC ATCTGAACTG
CCCGATGGGC GAGTGAGGGC GGTTGTACCT GCCTCGCCGA ATGAACGCAG CGTTGCGTCG
AATTCGCAGA CCACAGGCAA TCCGGTGTTT TCTCTTGGCG CGATGACCAT TGAGGTTATG
AACTATGGCG ATGATTACAA ACTATATGGT CGTCCCGATT GCAATTGGTT TTCGCTGGGT
GTAAAGGTCA GCGCATATGG TAAGACGGCG TTTTTGGCGG GGGATATTAA CAATTACGAC
GGCGACGAAG ATCGTCTTGC TTCGCTTCTG GGACACGTTG ACTTTTTGAA AACGGGTCAT
CATGGCTTTT GGGGTTCAAA CACAACGAAT TACCTTCATG CTTTGAGTCC ACGCTATGCG
GTGCAAACCG GCCCGTACGA TAGGCCTGAG ACACAGGTAC TCGAAGATTT TTGCGATATG
GGTACCAAGT TTTATACGAC ACCCGATGTT GCAGCAGAAG GGTATCAGGC CGTTATTGCG
ACGTTCAATT CGTCAGAACT GCAGATAAAT GTTGCCGACG ACAGCACACA CTATCGTATG
CGTTCGTTAG CACCAGCGGT TGTTCGGTGG GGTGGCCATG CGGGTGTAGC AAGTACCGGA
TGGGAATACA TCGAAGGAGC GTGGTACTAC TTTGATGGTC ATGCGTATGC GATGCATGAG
TCGTGGAAAG AAATCGATGG CAACTGGTAC TTTTTTGGCG AGGACTCGAA AATGGTTACC
GGCTGGGTCG ATTGGGATGG CTGGTATTAC ACCGGCGTTG ATGGCATTAT GCAGACCGGC
TGGCAGAAGA TAAAGGACGA TTGGTATTGC TTTGATGAAA CGGGCCTCAT GTTGTCTGAT
GCGTGGTCGG GTAATTATTG GCTTGGTCAT GATGGTGTAA TGGCACGCAA TCAATGGGTC
GATGATGGGC ATTACTATGT GGGTGCTGAT GGCGCTTGGG TGCCCAGCCG CACGCGACCG
GTATGGCGCT CTGACACGGT TGGCTGGTGG CTTGAACATC CCGATGGTTC CTATCCGCAA
AACACATGGG AAGCAGTTGA TGGTAGCTGG TACTACTTTA GTGCACTGGG CTATGTTGCG
TCAGGCTGGC AGCAGGTAGG TGGTCTCTGG TTTTATCTCA ATCCAGATCA TGATGGCAAT
TTTGGCGTCA TGCAAACTGG TTGGCAAAAC ATAGCTGGCA GCTGGTATCA CTTCGATTCA
TCAGGCGCCA TGGAAACGGG TTGGATAGAC GATAGCGGCT TGTGGTATTA CCTGCGCGAA
GATGGCACGA TGGCCACTGG TTGGCTCTAC TGTGATGGTA GTTGGTACTA TCTAAAAGAG
GACGGTAGCA TGGCGCGTGG TTGGCTTTCG TACGGTGGCG CGTGGTATCT TTTTTCTTCA
AGTGGAGCTA TGCAAGTTGG CTGGCGTCAA GACGATGGTG CTTGGTACTT CTTTTCCGAT
TCTGGCGTCA TGCAAACCGG TTGGGTTGAT GATGTTGCTG CGGGGAAGTC GTATCATTTA
CGTTCCGACG GCACGTGGGA TGGCCAAAGC CGCGCACTGT GA
 
Protein sequence
MKREISRRGF VIGGAYAGMA AATGAFMFSP SAAFAADVTR IHVMAFTDMD AIVLESNGRF 
AMVDSGEDST YPGGSDARYP WRDGITKGNG HENEVISYLH SLGATESNFE FYLGTHPHSD
HIGSASQVIN EFRPKRVYTP EYDDSYITDS NALWDNQYCY DRLVEAAHNV GATLITSFDT
SAPIDPVADA SAAQRSASVE TDPSAGTDAP AAESLSREDI IERFGVDPTD PHDPNNSSEL
PDGRVRAVVP ASPNERSVAS NSQTTGNPVF SLGAMTIEVM NYGDDYKLYG RPDCNWFSLG
VKVSAYGKTA FLAGDINNYD GDEDRLASLL GHVDFLKTGH HGFWGSNTTN YLHALSPRYA
VQTGPYDRPE TQVLEDFCDM GTKFYTTPDV AAEGYQAVIA TFNSSELQIN VADDSTHYRM
RSLAPAVVRW GGHAGVASTG WEYIEGAWYY FDGHAYAMHE SWKEIDGNWY FFGEDSKMVT
GWVDWDGWYY TGVDGIMQTG WQKIKDDWYC FDETGLMLSD AWSGNYWLGH DGVMARNQWV
DDGHYYVGAD GAWVPSRTRP VWRSDTVGWW LEHPDGSYPQ NTWEAVDGSW YYFSALGYVA
SGWQQVGGLW FYLNPDHDGN FGVMQTGWQN IAGSWYHFDS SGAMETGWID DSGLWYYLRE
DGTMATGWLY CDGSWYYLKE DGSMARGWLS YGGAWYLFSS SGAMQVGWRQ DDGAWYFFSD
SGVMQTGWVD DVAAGKSYHL RSDGTWDGQS RAL