Gene Haur_0296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0296 
Symbol 
ID5732191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp351585 
End bp354125 
Gene Length2541 bp 
Protein Length846 aa 
Translation table11 
GC content54% 
IMG OID641277420 
Productglycoside hydrolase family protein 
Protein accessionYP_001543076 
Protein GI159896829 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGTA CACCGCTCGA ACGTTCGAGC CGACGCTGGC GGATTATCGC CGGATTCGTT 
GCTTGTTTGC TGATTGCAGG GTTAATTATT AGCCCAACCA CCCCCACCAA AGCCGCCGAG
CCACTCTATC GCTATGGCGA GGCGCTGCAA AAGTCCTTTT TCTTCTACGA AGCTCAACAA
GCTGGGCCAA AACCAAGCTG GAATCGCGTT TCATGGCGCG GCGATTCGGT GCTTACCGAT
GGCGCTGATG TTGGTCTCAA TCTCAGCGGC GGCTGGTTCG ACGCAGGCGA TCACGTCAAA
TTTGGCTTTC CAATGGCGGC TTCGGCCACA ATGCTGGCGT GGGGCGCGGT CGAATATCGC
GATGCCTATG CCCAAAGCGG CCAACTCGAC GAATTGCTGA ACAATTTGCG CTTCGTCAAC
AACTACTTCA TCAATGCCCA TCCCTCGCCA AATGTGCTTT ATGGACAAGT TGGCAATGGC
GGCAAAGACC ACGCCTTCTG GGGACCAGCT GAAATTATTC ACCTCGACGA CCAAGCAGGC
CCACGACCAT CGTACAAAAT TGATGCAACT TGTGGTGGCT CAGATTTGGC AGGCGAAACC
GCCGCTGCCA TGGCTGCCTC GTCGATGGTC TTTCGCCCAA CCGACCCTGC TTATGCTGAT
ACGCTCCTAA GCCATGCTCG CCAACTCTAC ACGTTTGCCG ACACGGTGCG CGGCAAATAT
AGCGACTGTA TCACCGACGC TACCTCGTTC TACAACTCGT GGAGCGGTTA CAACGATGAG
TTGGTTTGGG GCGCAATTTG GCTCTATCGC GCTACGGGCG AAGCCAGCTA CCTGAGCAAG
GCCGAGCAAT ATTATGCCAA TCTCAGCACC GAACCCCAAA GCACAATCAA ATCGTATCGT
TGGAGCATCG CATGGGATGA TAAATCCTAT GGCTGTTATT TGTTGCTAGC CAAATTGACC
GGCAAACAAC AATACAAAGA CGATACCGAA CGCTGGTTGG ATTATTGGAC AGTCGGCTAT
AACGGCCAAC GTGTTACCTA TTCGCCAGGT GGCCTAGCAC AGTTGGATAC CTGGGGAGCC
TTGCGCTACT CGGCCAACAC CTCATTTGCC GCCTTTGTCT ACAGCGATTA CATCACCGAT
GCTACCAAAA AAGCTCGCTA CCACGACTTT GCGGTCAGCC AAATCAACTA TATGCTGGGC
AGCAATCCTC GCAACAGCAG CTATGTGGTT GGCTTCGGCA ATAATTCACC AGTCAATGTC
CACCATCGCA CCGCCCACGG CTCATGGACA GATTCATTGA GCAATCCAGT CAATCAACGC
CACATTTTAT ATGGGGCTTT GGTTGGCGGC CCAGCCAAAG GTTCGGGCGA TGCTTACACC
GATAGCCGCA ACGATTATGT GGCCAACGAA GTGGCGACCG ACTACAACGC AGGTTTTACT
AGCGCCTTGG CACGGATGTA TAGTGAATTT GGCGGCGCAC CACTCGCCAG CTTCCCACCA
ATCGAAACGC CTGAAGATGA ATTTTTCGTG GAAGCCAAAG TTAATGCTTC AGGCCCACGC
TTCATCGAAA TTAGCGGCGT ATTGCACAAC CAAAGTGCTT GGCCGGCCCG CAACAGCACC
AAACTCAGCT ATCGCTACTT TGTCGATTTG AGCGAAGTGT TTGCCGCTGG CTATGGCTTG
AGCGACGTTA CGGTTAGCAC AGCCTATACC CAAGGCTCAG GCGTTTCTAG CTTGAAGCAA
TGGGCTGGCA CAATTTACTA TGTCGAAATT GGCTTCAACG GAGTCAATGT CTACCCAGGT
GGTCAATCTG AATCACGCAA AGAAGTGCAA TTCCGACTTT CGTTGCCAAC CAACACCAAT
GCCCAACAAT GGGACAATAC CAATGACTGG TCGTTCAACG GCGTTGGCAC CAGCACCGAT
CGGGTCAAAA CCCGCCGGAT TCCGGTGTAT GACAATGGCG TGAAGGTCTT TGGCGATGAG
CCTGGTGGCA GCAACGTAAC CCCAACCGCA ACCAGCTTGC CAACCAACAC GGCTACGCCA
ACCGTGCGCC CAACCAACAC CGCAACCCCA ACCACGGGGC CAAGCGCAAC CCCAACTATT
CGCCCAACCA ACACGGCAAC CCCAACTGTT GGCCCAAGCG CAACCCCAAC CATCCGCCCG
ACCAATACAC CCACGGCCTT GCCAACAAAC ACACCGTTGC CAACGAACAC GCCAGTGGCT
GGGGCATGCC AAGTCAAATA TCGCGTTCCC AACGATTGGG GCAGCGGCTT CCTCGGCGAT
GTCACAATCA CCAACGGCGG CGCAGCGATC AATAGTTGGA ACTTGACTTG GAGCTTCGCA
GGCAGCCAAC AAATCACCAA CCTCTGGAGT GGGGTGGTGA GCCAAACCGG CCAAAACGTG
AGCGTCAGCA ACGCTGGCTG GAATGGGAGC CTTGCCAATG GTGGCTCCGT CAACTTCGGC
TTCCAAGCAA CCAACAACGG AACCAATAGC ATTCCTGCAA GCTTCAGCCT GAATGGGGCA
GCTTGTACGA TTGTGCCATA A
 
Protein sequence
MMSTPLERSS RRWRIIAGFV ACLLIAGLII SPTTPTKAAE PLYRYGEALQ KSFFFYEAQQ 
AGPKPSWNRV SWRGDSVLTD GADVGLNLSG GWFDAGDHVK FGFPMAASAT MLAWGAVEYR
DAYAQSGQLD ELLNNLRFVN NYFINAHPSP NVLYGQVGNG GKDHAFWGPA EIIHLDDQAG
PRPSYKIDAT CGGSDLAGET AAAMAASSMV FRPTDPAYAD TLLSHARQLY TFADTVRGKY
SDCITDATSF YNSWSGYNDE LVWGAIWLYR ATGEASYLSK AEQYYANLST EPQSTIKSYR
WSIAWDDKSY GCYLLLAKLT GKQQYKDDTE RWLDYWTVGY NGQRVTYSPG GLAQLDTWGA
LRYSANTSFA AFVYSDYITD ATKKARYHDF AVSQINYMLG SNPRNSSYVV GFGNNSPVNV
HHRTAHGSWT DSLSNPVNQR HILYGALVGG PAKGSGDAYT DSRNDYVANE VATDYNAGFT
SALARMYSEF GGAPLASFPP IETPEDEFFV EAKVNASGPR FIEISGVLHN QSAWPARNST
KLSYRYFVDL SEVFAAGYGL SDVTVSTAYT QGSGVSSLKQ WAGTIYYVEI GFNGVNVYPG
GQSESRKEVQ FRLSLPTNTN AQQWDNTNDW SFNGVGTSTD RVKTRRIPVY DNGVKVFGDE
PGGSNVTPTA TSLPTNTATP TVRPTNTATP TTGPSATPTI RPTNTATPTV GPSATPTIRP
TNTPTALPTN TPLPTNTPVA GACQVKYRVP NDWGSGFLGD VTITNGGAAI NSWNLTWSFA
GSQQITNLWS GVVSQTGQNV SVSNAGWNGS LANGGSVNFG FQATNNGTNS IPASFSLNGA
ACTIVP