Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1418 |
Symbol | |
ID | 5733326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1635363 |
End bp | 1638146 |
Gene Length | 2784 bp |
Protein Length | 927 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278556 |
Product | cellulose 1,4-beta-cellobiosidase |
Protein accession | YP_001544190 |
Protein GI | 159897943 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCTG ACCACAGCCG TACCGTGATT GGTATCCGTT CGACGAGGAG CTTTTGGGCA CGAGGATTTT TCGTCTTTTG GTTGATTATT GCTTTAGGCT GCCAACAAGC GATTGTTCCA GCCACCTCTC AGATTTATGC CCAACCCCCG CAAACTGTTG GTAATTTGCT GCAAAATGGC GATTTCAGCG CTGGTTTTGC GCCATGGTGG GCCACAAGCT CGGTTGCTAC TGATACTGCG AGTGGCGCAT TAGTCGCAAC CATCAATAAT GCTGGTAGTA ATCCATGGGA TGCAATCGTT GGCCAAAATG GAGTTACGCT TCAATCAGGC CAAACCTACA CCGTCACGTT TCGCATCCGT GCCTCAACCA CTGGTACTGT GGTGATGAAA CTGCAAAAAG AGGCCTCTCC CTATACCAAC TATTTTAGCC AAGACGTGGC TTTGACTACT AGCGATCAAG CCCATCAATT TGTCTTTACC TCAGGCTTTG ATGATGCTGG GGCTGCTTTT CAATTTCAAA TGGGCGGCCA AGGCACGAAT GTACTGACAA TTGACGATGT GCAAGTGTTG GGCGAAACTG GCCCGGTCGA GCCATCGGGC AATTTGGTGC AAAATGGCGA GTTTGTTGGT GGCCTACCGC CTTGGTGGAC TGGCGGCGAT GTCAGTGTGA ACACCGATGA TGGCGCGTGT TTGACGATCA ACACTCCTGG CACAAATCCT TGGGATGTGC AGCTTGGTCA ACATACCATT GCGATTGAAG CTGGTGTGAG CTATCAGCTT AAATTTGCTG CCAAATCAAC TGTGCCAGTC ACTTTGCCCG TGCGTTTGCA AAAAAATGCC GAGCCATACA CTGGCTATTT TTCGGCTGAT CCAGTGCTCA ACTCGGCCTG GACAGAATTT GTCTATAATT TTACATCGGC CTATAGCGAT GCAGCTTCGT TGTTGTTCCA AATGGGTGGC ATCGGAACCC CGACGATCTG TATTGATCAG GTGAGTTTGT ATATGTTAGA AACCGGAATT CGGGTCAACC AAGCCAGCTA TTTGCCCACG ATGAGCAAAG TGGCAAGTTT GGTGCATCCT GCGACTGAGC CATTGGCTTG GCAATTGCAT AACACTGCTG ATGCAGTGGT GGCAAGCGGC CAAACCACGG TGTATGGCGA AGAATCAGCC TCGGCTGAAC ATGTGCATTT GATCGATTTT TCAAGCTACC AAACGGTTGG TGAAGGCTAT TATCTGAGCA TCGGCAGCGA AACGAGCTAT CCATTTGCCA TCGAGGCAGG TTTGTATTCG CGGATGAAAT ATGATGCTTT GGCCTATTTC TACCATAATC GCAGCGGGAT TGCGATTACC ATGCCCTATG CTGGTGGCGA GCAATGGACG CGACCAGCTG GCCACATCGG GGTTGCACCC AACCAAGGCG ATACCTCGGT AACCTGTTTT ACTGGCACTG ACACCCAAGG TCAAAGCTGG CCCGGGTGCG ATTATCAACT TGACGTTTCA AAGGGTTGGT ACGACGCTGG CGACCATGGC AAATATGTGG TTAACAGTGG TATTTCGGTC TGGACGTTGC TTAATCAATA TGAACGAGCG CAGCAACGTG GCCCTGCCAG CCTAGCCCAA TTTGCCGATG GCACGATGAA TATTCCCGAA AATAACAATG GCGTGCCCGA TTTGTTGGAT GAAGTGCGCT GGAATATGGA GTTTATGCTA GGAATGCAAA TTCCTACGAC TGCGCCAGTT TCCAAAACGG GTATGGTGCA CCATAAAGTT CACGATGCGA ACTGGACTGG TTTGCCAATG GCTCCACACG AAGATAGCCA AATGCGCTAT TTGTATCCAC CAAGTACCGC CGCAACCTTG AACTTGGCTG CGACCGCTGC CCAATGTTCA CGGATTTGGC GTGAGATCGA TGAAGCCTTT GCTGATCGAT GTTTGGTGGC GGCTGAACGC GCTTGGCAAG CAGCCTTGGC CTACCCTAAC GAAATTGCCC GCGATAACTT CAATGGCGGT GGTGGCTATG GCGATAGCAC CTTGAACGAT GAATGGTATT GGGCCGCCGC CGAATTGTAT ATCACAACTG GCTCGGCCAC CTATCGCGAA GCAATTGAAG AATCGAGCTA TTACTTGCGG CTTGATCTTG GTGGTGGAAG CGCCATGAAC TGGGGCGGCG TGGCAAGCCT TGGCACCTTG TCGTTGGCCT TGGTTCCCAG TGATTTGAGT AGCGCGAATC GCCAAACTGC CCGTGCAGCA GTGATTGCAG CGGCTGATCA ATTTGTGGCA GCGCAGCAAT CCAGCGGTTA TGGCATTCCA TACAATCCTG GTGCGCAGTA TCCTTGGGGT TCCAACTCAT CAATTCTGAA TAATATGATT GTAATGGGTT TGGCAGGCGA CTTTACTGGA AATGCCAACT ATGCCGATGC AATTAGCCAA GGCATGGATT ACTTGCTGGG GCGTAATCCA CTCAATCGCT CGTATATTTC GGGCTATGGC TCGGTCTCAT TGACCAATCC ACATCATCGC TTCTGGGCTA AGCAAATTAA TCCAGAGTAT CCTGGTACGC CGCCAGGTGT GGTGGCTGGG GGGCCAAATT CAAGCATTCA AGACCCTTAT GCTCAAGTTG AATTAGCTGG TTGTGCGGCC TTGAAATGCT ATGTTGATCA TATCGATTCG TGGTCAACCA ATGAAGTGAC GATTAACTGG AACTCGCCGC TGGCATGGGT TGCGGCCTAT CTTGATGATT ATGATACAAG TTCGGTGCAG TATCTGCCGT TGATCAGTAA GTAA
|
Protein sequence | MSADHSRTVI GIRSTRSFWA RGFFVFWLII ALGCQQAIVP ATSQIYAQPP QTVGNLLQNG DFSAGFAPWW ATSSVATDTA SGALVATINN AGSNPWDAIV GQNGVTLQSG QTYTVTFRIR ASTTGTVVMK LQKEASPYTN YFSQDVALTT SDQAHQFVFT SGFDDAGAAF QFQMGGQGTN VLTIDDVQVL GETGPVEPSG NLVQNGEFVG GLPPWWTGGD VSVNTDDGAC LTINTPGTNP WDVQLGQHTI AIEAGVSYQL KFAAKSTVPV TLPVRLQKNA EPYTGYFSAD PVLNSAWTEF VYNFTSAYSD AASLLFQMGG IGTPTICIDQ VSLYMLETGI RVNQASYLPT MSKVASLVHP ATEPLAWQLH NTADAVVASG QTTVYGEESA SAEHVHLIDF SSYQTVGEGY YLSIGSETSY PFAIEAGLYS RMKYDALAYF YHNRSGIAIT MPYAGGEQWT RPAGHIGVAP NQGDTSVTCF TGTDTQGQSW PGCDYQLDVS KGWYDAGDHG KYVVNSGISV WTLLNQYERA QQRGPASLAQ FADGTMNIPE NNNGVPDLLD EVRWNMEFML GMQIPTTAPV SKTGMVHHKV HDANWTGLPM APHEDSQMRY LYPPSTAATL NLAATAAQCS RIWREIDEAF ADRCLVAAER AWQAALAYPN EIARDNFNGG GGYGDSTLND EWYWAAAELY ITTGSATYRE AIEESSYYLR LDLGGGSAMN WGGVASLGTL SLALVPSDLS SANRQTARAA VIAAADQFVA AQQSSGYGIP YNPGAQYPWG SNSSILNNMI VMGLAGDFTG NANYADAISQ GMDYLLGRNP LNRSYISGYG SVSLTNPHHR FWAKQINPEY PGTPPGVVAG GPNSSIQDPY AQVELAGCAA LKCYVDHIDS WSTNEVTINW NSPLAWVAAY LDDYDTSSVQ YLPLISK
|
| |