Gene Haur_0295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0295 
Symbol 
ID5732190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp348845 
End bp351409 
Gene Length2565 bp 
Protein Length854 aa 
Translation table11 
GC content54% 
IMG OID641277419 
Productglycoside hydrolase family protein 
Protein accessionYP_001543075 
Protein GI159896828 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.271703 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCGGC ACTATTACGC CCGAGGGGCG ATGCTGTTAG CGCTGCTAAC GATGATTGGT 
GGCCTGTTGA CCACCCAGAA TGCCAAGCCA ACCGCCGCCG CTGCCAGTTG TGTGGTCACC
TATCGCATTC CCAACGATTG GGGCAGTGGC TTCCTCGGCG ATGTTAATAT TCAAAATAAT
GGTGCAGCCA TCAGTAGCTG GACGGTTGGC TGGAGTTTCG CTGGCAATCA GCAAATTACC
AACCTCTGGA GTGGGATTGT AACCCAAACT GGCAACCAAG TAAGCGTGCG TAACGCAGGC
TGGAACGGCA CGATCAGCAG TGGTGGTGCA GTCAACTTTG GCTTCCAAGG AACCTACAGC
GGTGCTAATG CAATCCCAAC GGTGTTTACA TTAAATGGTG TGGTTTGTGG TGAAACGAAT
CCGAACCCAA CCGCAACCAC TCCACCAACC GCAACCACCC GCCCAACCAA CACGGTAGTT
GTACCAACCA ATACACCACG GGCAACTAAC ACAACCGTAC CACCAACTAA TACGGCTGTG
CCACCAACTA GCACAACTCG CCCAACTAAT ACGGCTGTGC CGCCAACCGC GACCAACGGC
CCAACCAGCA CGCCACGGCC CACCAATACC CCAACGGTTG TGCCACCAAC CAGCACCCCA
ACCCAACCAG GCGATGATAC CTACGATCAA CGCTTCTTGG AAATGTATGC TGAGTTGAAG
AACCCAGCTA ATGGCTATTT CAGCCCTGAA GGTGTGCCCT ACCACTCAAT CGAAACCTTG
ATTGTCGAAG CTCCGGATTA TGGCCACGAA ACCACTTCCG AAGCCTATAG CTATTGGTTG
TGGCTCGAAG CGATGTACGG TGAGGCAACT GGCAATTGGC AACCATTGGC CGATGCTTGG
CGCAACATGG AAATGTACAT CATTCCAACC AGCCAAGATC AACCAAGCAG TGGTTCGTAT
AATGCCAATA GCCCAGCCAC CTATGCTGGC GAATGGGAAT TACCCAGCCA ATATCCATCA
CAATTGCAAA CCAACGTTTC AGTTGGCCAA GACCCAATCG CCGCTGAATT GCGCTCGGCT
TATGGCACCA GCGATGTCTA TGGCATGCAC TGGTTGCTCG ACGTTGATAA CTGGTATGGC
TATGGCCGCC GTGGCGATGG CACCAGCAAA CCATCATATA TCAACACCTT CCAACGTGGC
GCTCAAGAAT CAGTTTGGGA AACCGTGCCC CATCCATCGT GGGAAAGTTT CAACGATGGC
GGCCCCTTCG GCTTCTTGAA CTTGTTCACT GGCGATGCGA GCTATGCCCG CCAATGGCGC
TACACCAACG CCCCCGATGC TGATGCCCGT GCTGTCCAAG CGATTTACTG GGCCAAAGTA
TGGGCCGACG AACAAGGTGG CTCACCAATC GTCAATGGTT TGGTAACCAA GGCCGCCAAG
ATGGGCGACT ACTTGCGCTA CGCCTTCTTC GATAAGTACT TCAAGCAAAT TGGCTGTACT
TCAACTTCAT GTCCAGCTGG TTCAGGCTAC AGCAGCGCTC ACTACCTATT GTCGTGGTAC
TACGCTTGGG GTGGTTCGAT TGGCAATGGT GGTGGCTGGG CATGGCGCAT CGGCAGCAGC
CACAATCACT TTGGCTACCA AAACCCAATG GCAGCCTGGA TTTTGGGCAG CCAACCAGCC
TTCAAGCCAG CTTCAACCAA CGGCGCTCGC GACTGGAACA CCAGCTTGAC CCGCCAAATC
GAGTTCTATA CTTGGTTGCA ATCAAGCGAA GGTGCCATCG CTGGTGGTGC TACCAATAGC
TGGAATGGCC GCTACGAAGC AGCCCCAGCC GGAACCAGCA CCTTCTACAA TATGGCCTAC
GACGAAAAAC CAGTCTATCA CGATCCCGCT AGCAACACCT GGTTTGGTTT CCAAGCTTGG
TCGATGGAAC GGGTCGCTGA ATATTACTAC GCTTCAGGCG ATGTCAAAGC CAAGAACGTG
CTCGATAAGT GGGTAACCTG GGCTTTGGCC AACACCACCT TGACCAGCAA TGGCAGCTAT
GAAATTCCAT CAACCTTGGC TTGGAGCGGC CAACCAGCTA CCTGGAACGC CAGCAATCCA
GCCGCTAACA CCAACTTGCA CGTCACGGTC GTCGATAAGA CCCAAGACGT TGGTGTTGCC
GCCGCCTATG CCAAAACCTT GATGTACTAC AGCGCTGCAA CCAAGCGCTA TGGCACTCAA
CATGTCGCTT CACAAACCAT GGCCAAAGAA TTGATCGACC GTATGTGGTC AGAATACCGC
GATGATAAGG GTGTTGCGAA CCCCGAAACC CGCCGCGATT ACAACCGCTT CGATGATCCA
GTGTCAGTGC CAAACGGCTG GACTGGCACC ATGGCCAATG GTGATCCAAT CAACAATTCA
TCAACCTTCT TGAGCATTCG CACCAAGTAC GAAGATGATC CAGCCTTCCC AGCCGTGCAA
GCCTACCTAA ACGGTGGCCC TGCTCCAACC TTCACCTACC ACCGCTTCTG GGCCCAGGCC
GATATCGCAA TGGCTTACGC CGAATACGAT CGCTTGTTCC AGTAA
 
Protein sequence
MSRHYYARGA MLLALLTMIG GLLTTQNAKP TAAAASCVVT YRIPNDWGSG FLGDVNIQNN 
GAAISSWTVG WSFAGNQQIT NLWSGIVTQT GNQVSVRNAG WNGTISSGGA VNFGFQGTYS
GANAIPTVFT LNGVVCGETN PNPTATTPPT ATTRPTNTVV VPTNTPRATN TTVPPTNTAV
PPTSTTRPTN TAVPPTATNG PTSTPRPTNT PTVVPPTSTP TQPGDDTYDQ RFLEMYAELK
NPANGYFSPE GVPYHSIETL IVEAPDYGHE TTSEAYSYWL WLEAMYGEAT GNWQPLADAW
RNMEMYIIPT SQDQPSSGSY NANSPATYAG EWELPSQYPS QLQTNVSVGQ DPIAAELRSA
YGTSDVYGMH WLLDVDNWYG YGRRGDGTSK PSYINTFQRG AQESVWETVP HPSWESFNDG
GPFGFLNLFT GDASYARQWR YTNAPDADAR AVQAIYWAKV WADEQGGSPI VNGLVTKAAK
MGDYLRYAFF DKYFKQIGCT STSCPAGSGY SSAHYLLSWY YAWGGSIGNG GGWAWRIGSS
HNHFGYQNPM AAWILGSQPA FKPASTNGAR DWNTSLTRQI EFYTWLQSSE GAIAGGATNS
WNGRYEAAPA GTSTFYNMAY DEKPVYHDPA SNTWFGFQAW SMERVAEYYY ASGDVKAKNV
LDKWVTWALA NTTLTSNGSY EIPSTLAWSG QPATWNASNP AANTNLHVTV VDKTQDVGVA
AAYAKTLMYY SAATKRYGTQ HVASQTMAKE LIDRMWSEYR DDKGVANPET RRDYNRFDDP
VSVPNGWTGT MANGDPINNS STFLSIRTKY EDDPAFPAVQ AYLNGGPAPT FTYHRFWAQA
DIAMAYAEYD RLFQ