Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2136 |
Symbol | |
ID | 5734038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2684002 |
End bp | 2687388 |
Gene Length | 3387 bp |
Protein Length | 1128 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641279277 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001544904 |
Protein GI | 159898657 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCGGA TCTTTAATTC CCGTGGATTA ATTCTTGTTC TTTTGACCGC CGTGGTTGGC AGTGGCCTAG GCCAATTGAA TGCCAAACCA ACCGCCGCAG CTACCAGCTG CGAAGTCGTC TATAACATTG CTAACGATTG GGGTAGCGGT TTTATTGGCG ATGTGACGCT TAAAAACACT GGTTCAGCCG TTTCTAGCTG GACGCTCGGC TGGTCGTTTG CTGGCAACCA AAACATCTCA AATCTTTGGG GTGGGGTTGT CGCGCAAAAC GGCGCGAATG TGAGCGTTAG CAATGCAGGC TGGAATGGCA ACATCGGCAC TGGCGGCACG GTAAACTTCG GTTTCCAAGC TGGCTATAGT GGCGCAAATG CTAAGCCAAC CGTCTTTACC CTCAATGGTG TAACTTGTGG CGGCACGGTT GTTCCAACCA CACCACCAAC CAACACGACA GTTCCAACCA CACCGCCAAC CAATACACCG GTTCCAAGCG CAACCACGCG CCCAACCAAC ACGACGGTTC CAACTACACC ACCAACTAGC ACAACTGGCC CAACGAATGT TCCAACGAAC ACGCCAGTTC CAAGCGCAAC CACGCGCCCA ACCAACACGA CAGTTCCGAC TACTCCACCA ACCAGCACGA CTGGCCCAAC CAATACGCCA GCACCAACAA CTGTACCAGG TGTGCACGTT GCTAACCCAT TTGTGGGCGC ACAAGGCTAC ATCAACAGTG AATATGCTGC ACGGGTGAAC GCCGAAGCCA ATGCCACTGG TGGCACGCTT GGTTCACAAA TGCGTCAAGT GGCCTCGTAT CCAACTGCTG TATGGCTCGA CCGAATTGCC GCAATCGCTG GCAGCAGCGA ATCGATGGGC TTACGCGCTC ACCTTGATGC TGCCTTGGTG CAACAACAAA CCAGCGGTCA ACAAGTGGCA ATCTCAATTG TGGTGTACGA CTTGCCCAAC CGCGACTGTG CCGCCTTGGC TTCGAACGGT GAGTTGAAAA TCTCGGAAAA TGGCTTGAAT CGCTACAAAA CCGAGTATAT CGACCCGATT GCCGCTATCG TGGGCGAGAG CAAATATGCC TCATTACGGA TTGTGGTGAT TTTGGAACCA GATTCACTTT CGAACTTGGT GACCAACGCT AGCATTCCCG CCTGTGCTGA AGCGATTTCG AGCGGAGCCT ACGTCCAAGG TGTTCAATAT GCGATCAACA AACTGAATGT GACCAGCAAC GTCTACATCT ACATGGACAT TGCCCACTCA GGTTGGTTGG GCTGGGACAG CAACTTCACG CCCGCAATTC AGCTGTACAC CCAAACGGTG CGCGGCACGA CCAAAGGCTT GAACGGGATC GATGGCTTTA TCTCGAACAC CGCCAACTAC ACACCATTGA ACGAAGTGTT CTTGCCAAAC TCGGGCTTGA CCTTGGGTGG TGGCAACCCG ATTCGCTCGT CGTTGTTCTA CGAATGGAAC CCGTACTTCG ACGAAACCGA CTATGTGTTG GCGATGCGCA ATGCCTTCAT CACCGCCGGA TTCCCCAGCG GCATCGGGAT GTTGATTGAT ACCAGCCGCA ACGGTTGGGG TGGCACGGCA CGCCCAACGA TGGTGAGCAG CTCGAATTCA TTGGAAATCT ATGTCAACGA TTCGAAGCTG GATCGTCGCC CACATCGTGG TGGCTGGTGT AACCAAGCCG GGGCAGGGAT TGGCGAACGG CCAACGGCAG CCCCCGTGAG CGGGATTGAT GCCTATGTGT GGGTCAAACC GCCAGGCGAA TCGGACGGTG TGGCGACGGC AGGGGTGATT GACCCAACCG ACCCAGCCAA GCAGTTTGAT GCGATGTGTG ACCCGAACGC GCAAAACCGC TACAACACAG CATACCCAAC CAATGCATTG GCGGGAGCAC CGCACGCAGG TCGCTGGTTC CCATCGCAAT TTGCAATGTT GGTTCGCAAT GCCTATCCAC CAATTTCGCA ATCGACCAAC CCAACAACTA CTCCGGTGGC AACGACGGTT CCACCAACCA GCACGCCAGT TGGCACACCA GCGCCAACCA GCACGCCAGC ACCAACTAGC ACACCAGCGC CAACGAGTGT CCCAGGTGTA CATGTTGCTA ACCCATTTGT GGGCGCACAA GGCTACATCA ACAGTGAATA TGCTGCACGG GTGAACGCCG AAGCCAATGC CACTGGTGGC ACGCTTGGTT CACAAATGCT CAAGGTTGCT TCATATCCAA CCGCTGTGTG GCTCGATCGA ATTGCCGCAA TCGCTGGCAG CAGCGACTCG ATGGGCTTAC GTGCTCACCT TGATGCCGCA CTGGTGCAAC AACAAACCAG TGGTCAAGTG GCAATCTCGA TTGTGGTCTA CGACTTGCCC AACCGCGACT GTGCCGCCTT GGCCTCGAAC GGCGAGTTGA AAATCTCGGA AAATGGCTTG AATCGCTATA AAACCGAGTA TATCGACCCG ATTGCCGCTA TCGTGGGCGA GAGCAAATAT GCCTCATTAC GGATTGTGGT GATTTTGGAA CCAGATTCAC TTTCGAACTT GGTGACCAAC GCTAGCATTC CCGCCTGTGC TGAGGCGATT TCAAGCGGAG CCTATGTCCA AGGTGTTCAA TATGCCATCA ACAAACTGAA TGTGACCAGC AATGTCTACA TCTACATGGA CATTGCCCAC TCAGGTTGGT TGGGCTGGGA CAGCAACTTC ACGCCCGCGA TTCAGCTGTA CACCCAAACG GTGCGCGGCA CGACCAAAGG CTTGAACGGG ATCGATGGCT TTATCTCGAA CACGGCCAAC TACACGCCAT TGAACGAAAT CTTCTTGCCA AACTCGGGGT TGACCTTAGG TGGTGGCAAC CCGATTCGCT CGTCGTTGTT CTACGAATGG AACCCGTACT TCGACGAAAC CGACTATGTG TTGGCGATGC GCAATGCCTT CATCACCGCC GGATTCCCCA GCGGCATCGG GATGTTGATT GATACCAGCC GCAACGGTTG GGGTGGCACG GCACGCCCAA CGATGGTGAG CAGCTCGAAT TCATTGGAAA TCTATGTCAA CGATTCGAAG CTGGATCGTC GCCCACATCG TGGTGGCTGG TGTAACCAAG CCGGGGCAGG GATTGGCGAA CGGCCAACAG CAGCCCCCGT GAGCGGGATT GATGCCTATG TGTGGGTCAA ACCGCCAGGC GAATCAGACG GTGTGGCGAC GGCAGGGGTG ATTGACCCAA CCGACCCAGC CAAGCAGTTT GATGCGATGT GTGACCCGAA CGCGCAAAAC CGCTACAACA CAGCATACCC AACCAATGCA TTGGCGGGAG CACCGCACGC AGGTCGCTGG TTCCCATCGC AATTTGCAAT GTTGGTTCGC AACGCCTATC CTGCAATTGC ACCGTAG
|
Protein sequence | MARIFNSRGL ILVLLTAVVG SGLGQLNAKP TAAATSCEVV YNIANDWGSG FIGDVTLKNT GSAVSSWTLG WSFAGNQNIS NLWGGVVAQN GANVSVSNAG WNGNIGTGGT VNFGFQAGYS GANAKPTVFT LNGVTCGGTV VPTTPPTNTT VPTTPPTNTP VPSATTRPTN TTVPTTPPTS TTGPTNVPTN TPVPSATTRP TNTTVPTTPP TSTTGPTNTP APTTVPGVHV ANPFVGAQGY INSEYAARVN AEANATGGTL GSQMRQVASY PTAVWLDRIA AIAGSSESMG LRAHLDAALV QQQTSGQQVA ISIVVYDLPN RDCAALASNG ELKISENGLN RYKTEYIDPI AAIVGESKYA SLRIVVILEP DSLSNLVTNA SIPACAEAIS SGAYVQGVQY AINKLNVTSN VYIYMDIAHS GWLGWDSNFT PAIQLYTQTV RGTTKGLNGI DGFISNTANY TPLNEVFLPN SGLTLGGGNP IRSSLFYEWN PYFDETDYVL AMRNAFITAG FPSGIGMLID TSRNGWGGTA RPTMVSSSNS LEIYVNDSKL DRRPHRGGWC NQAGAGIGER PTAAPVSGID AYVWVKPPGE SDGVATAGVI DPTDPAKQFD AMCDPNAQNR YNTAYPTNAL AGAPHAGRWF PSQFAMLVRN AYPPISQSTN PTTTPVATTV PPTSTPVGTP APTSTPAPTS TPAPTSVPGV HVANPFVGAQ GYINSEYAAR VNAEANATGG TLGSQMLKVA SYPTAVWLDR IAAIAGSSDS MGLRAHLDAA LVQQQTSGQV AISIVVYDLP NRDCAALASN GELKISENGL NRYKTEYIDP IAAIVGESKY ASLRIVVILE PDSLSNLVTN ASIPACAEAI SSGAYVQGVQ YAINKLNVTS NVYIYMDIAH SGWLGWDSNF TPAIQLYTQT VRGTTKGLNG IDGFISNTAN YTPLNEIFLP NSGLTLGGGN PIRSSLFYEW NPYFDETDYV LAMRNAFITA GFPSGIGMLI DTSRNGWGGT ARPTMVSSSN SLEIYVNDSK LDRRPHRGGW CNQAGAGIGE RPTAAPVSGI DAYVWVKPPG ESDGVATAGV IDPTDPAKQF DAMCDPNAQN RYNTAYPTNA LAGAPHAGRW FPSQFAMLVR NAYPAIAP
|
| |