Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1153 |
Symbol | |
ID | 5733046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1323927 |
End bp | 1325900 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278293 |
Product | alpha amylase catalytic region |
Protein accession | YP_001543929 |
Protein GI | 159897682 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00213284 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACCAC TCACAGTTCA AACTATGCAT TTGCCCAATC CAACCACGTT CGATGATCTG CTCGATCAAC AGATTGCCAA TTCGCGTGAT CGCGATATTT TTCGCTTGCG CATGCAACGC CATTTTGGCG ATTGTTTAGA AGCGCTAGGA GCGTTGTATG CCCAGCATCC AGCTTGGCCA CAGTTGTTGG AGCAATTGCC CGAACGCTTG ATTACTGCCT ATGCCCAGCG CCGCGATGCC CTGAAAATTC ACGATTTAGC CCGCGAAATC CAGCCCGATT GGTTTGCTGA GGCCACCATG GTTGGCGGCA TTTACTATGT TGATCGCTTG GCAGGCACAT TGCGCGGGGT GATTGAGCAT ATTGATTATT TGCAAGAATT GGGTTTGACC TATGTGCATC TGATGCCGCT ATTACAGCCA CGCCATGGCC CCAACGATGG CGGCTATGCG GTGCTCGATT ATCGCTCGAT TGATCAACGG CTTGGCAATG TGGCCGATTT TATCGAATTA AGCGATTTGC TCCGTACCAA CGGCATCAGC TTATGCATTG ATGTGGTGGT GAATCACACG GCCAAAGAGC ATGAATGGGC AGTCAAGGCC CGTGCTGGTG ATGCCCAATA TTTGGATTAC TATCTGAGTT TTGCCGATCG CAGTTTGCCT GATGCCTATG AGCAACATTT ACCCGAAGTG TTTCCCGATT TTGCGCCTGG TAATTTTACT TGGTATGCCG AGTTGAGCGA GCATGGCCGT TGGGTTTGGA CGACCTTCAA CGAATTTCAA TGGGATTTGA ACTATACCAA CCCCATGGTT TGGCTGGAGA TGCTGGATAT TTTGCTGTAT CTCGCCAATC TAGGCGTTGA TGTGCTGCGT TTGGATGCCG TGCCGTTTAT GTGGAAACGC CTCGGCACGA ATTGCCAAAA TCAGCCCGAA GTGCTCGATT TGTTACAAGC TTGGCGAGCA GCCATGCGGA TCGTCTGTCC GGCGACAATT TTCAAGGCCG AGGCGATTGT TGCCCCCGAC GATTTGGTGC AATATTTGGG TTTGGGACGG CGCACAGGCA AGCTCTGTGA AATTGCCTAC CATAATTCGC TGATGGTGTT GTTGTGGAGT GCCTTGGCCT CGCAACGCGC CGATCTGTTT ACGCAATCGC TGTTGAACAT GCCTGCAACG CCCAGCAATG CCGCTTGGAT TACCTATGTG CGCTGCCACG ATGATATTGG CTGGGCTGTG ACCGACCACA ATGCAGCTTT GGTTGGCGAA GATGGGCCAT TGCATCGCCA ATTTTTAAGC GCTTGGTATA GTGGCGAATT TGCTGGTAGT TTTGCGCGGG GCGAGGTGTT TCAATATAAT CCACTCACCA ACGATCGCCG AATTAGCGGC ATGACTGCCT CGTTGGCTGG GCTAGAGCAA GCCTTGGAAA CCACCGATCC AGCAGCGATT GAATTGACAA TTCGCCGGAT TGCGTTGCTG TATGCCGTGA TTTTTAGCTT TGGTGGCATT CCGTTGATCT ATATGGGCGA TGAATTGGGC ATGCTCAATG ATCACAGCTA CTTGCATGAC CCTACCAAAG CCAACGATAA CCGCTGGTTG CATCGCCCAG CCATGGATTG GTGCTTAGCG GCCCAACGCC ATGATCCAAC TACGCTTGCT GGGCGCTTAT GGCAGGTATT GCGCCATTTG ATTCAGGTGC GCCAACATAC TCCAGCCTTG CATAGCGCAG GCCAAACCTT GCCAATCTGG ACACAGCAAC GCCATGTTTT AGGGGTGGTT CGAGTTCACC CATTGGGGCG AATTTTAATT CTTGGAAACC TTTCCGCCAC CCCACAGCGG GTCAGTTTAG CGGTTATTCA ACAAGCAGGG CTGGTTGGTC GCTTATATAA TTTGTTGGAT AACGATTCAC TTAATATCGA TACACAAAGC CATGAAATTA TACTCGATGC ATATCAATGT TGTTGGCTCA GCATTCAAGC CTAA
|
Protein sequence | MQPLTVQTMH LPNPTTFDDL LDQQIANSRD RDIFRLRMQR HFGDCLEALG ALYAQHPAWP QLLEQLPERL ITAYAQRRDA LKIHDLAREI QPDWFAEATM VGGIYYVDRL AGTLRGVIEH IDYLQELGLT YVHLMPLLQP RHGPNDGGYA VLDYRSIDQR LGNVADFIEL SDLLRTNGIS LCIDVVVNHT AKEHEWAVKA RAGDAQYLDY YLSFADRSLP DAYEQHLPEV FPDFAPGNFT WYAELSEHGR WVWTTFNEFQ WDLNYTNPMV WLEMLDILLY LANLGVDVLR LDAVPFMWKR LGTNCQNQPE VLDLLQAWRA AMRIVCPATI FKAEAIVAPD DLVQYLGLGR RTGKLCEIAY HNSLMVLLWS ALASQRADLF TQSLLNMPAT PSNAAWITYV RCHDDIGWAV TDHNAALVGE DGPLHRQFLS AWYSGEFAGS FARGEVFQYN PLTNDRRISG MTASLAGLEQ ALETTDPAAI ELTIRRIALL YAVIFSFGGI PLIYMGDELG MLNDHSYLHD PTKANDNRWL HRPAMDWCLA AQRHDPTTLA GRLWQVLRHL IQVRQHTPAL HSAGQTLPIW TQQRHVLGVV RVHPLGRILI LGNLSATPQR VSLAVIQQAG LVGRLYNLLD NDSLNIDTQS HEIILDAYQC CWLSIQA
|
| |