Gene Haur_2136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2136 
Symbol 
ID5734038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2684002 
End bp2687388 
Gene Length3387 bp 
Protein Length1128 aa 
Translation table11 
GC content55% 
IMG OID641279277 
Productglycoside hydrolase family protein 
Protein accessionYP_001544904 
Protein GI159898657 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCGGA TCTTTAATTC CCGTGGATTA ATTCTTGTTC TTTTGACCGC CGTGGTTGGC 
AGTGGCCTAG GCCAATTGAA TGCCAAACCA ACCGCCGCAG CTACCAGCTG CGAAGTCGTC
TATAACATTG CTAACGATTG GGGTAGCGGT TTTATTGGCG ATGTGACGCT TAAAAACACT
GGTTCAGCCG TTTCTAGCTG GACGCTCGGC TGGTCGTTTG CTGGCAACCA AAACATCTCA
AATCTTTGGG GTGGGGTTGT CGCGCAAAAC GGCGCGAATG TGAGCGTTAG CAATGCAGGC
TGGAATGGCA ACATCGGCAC TGGCGGCACG GTAAACTTCG GTTTCCAAGC TGGCTATAGT
GGCGCAAATG CTAAGCCAAC CGTCTTTACC CTCAATGGTG TAACTTGTGG CGGCACGGTT
GTTCCAACCA CACCACCAAC CAACACGACA GTTCCAACCA CACCGCCAAC CAATACACCG
GTTCCAAGCG CAACCACGCG CCCAACCAAC ACGACGGTTC CAACTACACC ACCAACTAGC
ACAACTGGCC CAACGAATGT TCCAACGAAC ACGCCAGTTC CAAGCGCAAC CACGCGCCCA
ACCAACACGA CAGTTCCGAC TACTCCACCA ACCAGCACGA CTGGCCCAAC CAATACGCCA
GCACCAACAA CTGTACCAGG TGTGCACGTT GCTAACCCAT TTGTGGGCGC ACAAGGCTAC
ATCAACAGTG AATATGCTGC ACGGGTGAAC GCCGAAGCCA ATGCCACTGG TGGCACGCTT
GGTTCACAAA TGCGTCAAGT GGCCTCGTAT CCAACTGCTG TATGGCTCGA CCGAATTGCC
GCAATCGCTG GCAGCAGCGA ATCGATGGGC TTACGCGCTC ACCTTGATGC TGCCTTGGTG
CAACAACAAA CCAGCGGTCA ACAAGTGGCA ATCTCAATTG TGGTGTACGA CTTGCCCAAC
CGCGACTGTG CCGCCTTGGC TTCGAACGGT GAGTTGAAAA TCTCGGAAAA TGGCTTGAAT
CGCTACAAAA CCGAGTATAT CGACCCGATT GCCGCTATCG TGGGCGAGAG CAAATATGCC
TCATTACGGA TTGTGGTGAT TTTGGAACCA GATTCACTTT CGAACTTGGT GACCAACGCT
AGCATTCCCG CCTGTGCTGA AGCGATTTCG AGCGGAGCCT ACGTCCAAGG TGTTCAATAT
GCGATCAACA AACTGAATGT GACCAGCAAC GTCTACATCT ACATGGACAT TGCCCACTCA
GGTTGGTTGG GCTGGGACAG CAACTTCACG CCCGCAATTC AGCTGTACAC CCAAACGGTG
CGCGGCACGA CCAAAGGCTT GAACGGGATC GATGGCTTTA TCTCGAACAC CGCCAACTAC
ACACCATTGA ACGAAGTGTT CTTGCCAAAC TCGGGCTTGA CCTTGGGTGG TGGCAACCCG
ATTCGCTCGT CGTTGTTCTA CGAATGGAAC CCGTACTTCG ACGAAACCGA CTATGTGTTG
GCGATGCGCA ATGCCTTCAT CACCGCCGGA TTCCCCAGCG GCATCGGGAT GTTGATTGAT
ACCAGCCGCA ACGGTTGGGG TGGCACGGCA CGCCCAACGA TGGTGAGCAG CTCGAATTCA
TTGGAAATCT ATGTCAACGA TTCGAAGCTG GATCGTCGCC CACATCGTGG TGGCTGGTGT
AACCAAGCCG GGGCAGGGAT TGGCGAACGG CCAACGGCAG CCCCCGTGAG CGGGATTGAT
GCCTATGTGT GGGTCAAACC GCCAGGCGAA TCGGACGGTG TGGCGACGGC AGGGGTGATT
GACCCAACCG ACCCAGCCAA GCAGTTTGAT GCGATGTGTG ACCCGAACGC GCAAAACCGC
TACAACACAG CATACCCAAC CAATGCATTG GCGGGAGCAC CGCACGCAGG TCGCTGGTTC
CCATCGCAAT TTGCAATGTT GGTTCGCAAT GCCTATCCAC CAATTTCGCA ATCGACCAAC
CCAACAACTA CTCCGGTGGC AACGACGGTT CCACCAACCA GCACGCCAGT TGGCACACCA
GCGCCAACCA GCACGCCAGC ACCAACTAGC ACACCAGCGC CAACGAGTGT CCCAGGTGTA
CATGTTGCTA ACCCATTTGT GGGCGCACAA GGCTACATCA ACAGTGAATA TGCTGCACGG
GTGAACGCCG AAGCCAATGC CACTGGTGGC ACGCTTGGTT CACAAATGCT CAAGGTTGCT
TCATATCCAA CCGCTGTGTG GCTCGATCGA ATTGCCGCAA TCGCTGGCAG CAGCGACTCG
ATGGGCTTAC GTGCTCACCT TGATGCCGCA CTGGTGCAAC AACAAACCAG TGGTCAAGTG
GCAATCTCGA TTGTGGTCTA CGACTTGCCC AACCGCGACT GTGCCGCCTT GGCCTCGAAC
GGCGAGTTGA AAATCTCGGA AAATGGCTTG AATCGCTATA AAACCGAGTA TATCGACCCG
ATTGCCGCTA TCGTGGGCGA GAGCAAATAT GCCTCATTAC GGATTGTGGT GATTTTGGAA
CCAGATTCAC TTTCGAACTT GGTGACCAAC GCTAGCATTC CCGCCTGTGC TGAGGCGATT
TCAAGCGGAG CCTATGTCCA AGGTGTTCAA TATGCCATCA ACAAACTGAA TGTGACCAGC
AATGTCTACA TCTACATGGA CATTGCCCAC TCAGGTTGGT TGGGCTGGGA CAGCAACTTC
ACGCCCGCGA TTCAGCTGTA CACCCAAACG GTGCGCGGCA CGACCAAAGG CTTGAACGGG
ATCGATGGCT TTATCTCGAA CACGGCCAAC TACACGCCAT TGAACGAAAT CTTCTTGCCA
AACTCGGGGT TGACCTTAGG TGGTGGCAAC CCGATTCGCT CGTCGTTGTT CTACGAATGG
AACCCGTACT TCGACGAAAC CGACTATGTG TTGGCGATGC GCAATGCCTT CATCACCGCC
GGATTCCCCA GCGGCATCGG GATGTTGATT GATACCAGCC GCAACGGTTG GGGTGGCACG
GCACGCCCAA CGATGGTGAG CAGCTCGAAT TCATTGGAAA TCTATGTCAA CGATTCGAAG
CTGGATCGTC GCCCACATCG TGGTGGCTGG TGTAACCAAG CCGGGGCAGG GATTGGCGAA
CGGCCAACAG CAGCCCCCGT GAGCGGGATT GATGCCTATG TGTGGGTCAA ACCGCCAGGC
GAATCAGACG GTGTGGCGAC GGCAGGGGTG ATTGACCCAA CCGACCCAGC CAAGCAGTTT
GATGCGATGT GTGACCCGAA CGCGCAAAAC CGCTACAACA CAGCATACCC AACCAATGCA
TTGGCGGGAG CACCGCACGC AGGTCGCTGG TTCCCATCGC AATTTGCAAT GTTGGTTCGC
AACGCCTATC CTGCAATTGC ACCGTAG
 
Protein sequence
MARIFNSRGL ILVLLTAVVG SGLGQLNAKP TAAATSCEVV YNIANDWGSG FIGDVTLKNT 
GSAVSSWTLG WSFAGNQNIS NLWGGVVAQN GANVSVSNAG WNGNIGTGGT VNFGFQAGYS
GANAKPTVFT LNGVTCGGTV VPTTPPTNTT VPTTPPTNTP VPSATTRPTN TTVPTTPPTS
TTGPTNVPTN TPVPSATTRP TNTTVPTTPP TSTTGPTNTP APTTVPGVHV ANPFVGAQGY
INSEYAARVN AEANATGGTL GSQMRQVASY PTAVWLDRIA AIAGSSESMG LRAHLDAALV
QQQTSGQQVA ISIVVYDLPN RDCAALASNG ELKISENGLN RYKTEYIDPI AAIVGESKYA
SLRIVVILEP DSLSNLVTNA SIPACAEAIS SGAYVQGVQY AINKLNVTSN VYIYMDIAHS
GWLGWDSNFT PAIQLYTQTV RGTTKGLNGI DGFISNTANY TPLNEVFLPN SGLTLGGGNP
IRSSLFYEWN PYFDETDYVL AMRNAFITAG FPSGIGMLID TSRNGWGGTA RPTMVSSSNS
LEIYVNDSKL DRRPHRGGWC NQAGAGIGER PTAAPVSGID AYVWVKPPGE SDGVATAGVI
DPTDPAKQFD AMCDPNAQNR YNTAYPTNAL AGAPHAGRWF PSQFAMLVRN AYPPISQSTN
PTTTPVATTV PPTSTPVGTP APTSTPAPTS TPAPTSVPGV HVANPFVGAQ GYINSEYAAR
VNAEANATGG TLGSQMLKVA SYPTAVWLDR IAAIAGSSDS MGLRAHLDAA LVQQQTSGQV
AISIVVYDLP NRDCAALASN GELKISENGL NRYKTEYIDP IAAIVGESKY ASLRIVVILE
PDSLSNLVTN ASIPACAEAI SSGAYVQGVQ YAINKLNVTS NVYIYMDIAH SGWLGWDSNF
TPAIQLYTQT VRGTTKGLNG IDGFISNTAN YTPLNEIFLP NSGLTLGGGN PIRSSLFYEW
NPYFDETDYV LAMRNAFITA GFPSGIGMLI DTSRNGWGGT ARPTMVSSSN SLEIYVNDSK
LDRRPHRGGW CNQAGAGIGE RPTAAPVSGI DAYVWVKPPG ESDGVATAGV IDPTDPAKQF
DAMCDPNAQN RYNTAYPTNA LAGAPHAGRW FPSQFAMLVR NAYPAIAP