Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2435 |
Symbol | |
ID | 5734316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3118970 |
End bp | 3120325 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279576 |
Product | alpha amylase catalytic region |
Protein accession | YP_001545203 |
Protein GI | 159898956 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCCAT GGGCCAACGA CGTACAATGT TATCAAATTT ATCCCTTAGG GCTTTGCGGC GCACCAATTC GCAATGATCA AACCAGCGAA CCGATTGCGC GGCTCAAGCA ATTGCACACA TGGATTGAGC ACTTACAACA TTTAGGCAGC AACTTGCTCT ACCTTGGGCC AGTGTTTGAA TCGACAGCTC ATGGCTACGA CACAATTGAT TATTTCACGG TTGATCGGCG GCTAGGTAGC AACAACGATT TACAACAGTT GATTGCAGCA TTCCACGCGG CGGGCATTCG TGTGTTACTC GATGGCGTGT TTAACCATGT TGGCCGCGAT TTTTGGGCCT TCCGCGATGT ACAAAGCCAT GGGCAGGCTT CGAGCTATAG CGATTGGTTT GCGGGCTTGG ATTTCAATCA ACACAGCCCA TATGGCGATC AGTTTAGCTA TCAAGGTTGG CATGGGCACT ACGATTTAGT CAAATTGAAT TTACACAATC CGGCAGTGCG CGAACATCTT TTCCAAGCAG TCAGCCAATG GATTGCACAA TTTGGCATCG ATGGCTTACG GCTTGATGCT GCGGATCAGA TCGATCATGA TTTTTTGGCG GCGCTGGCAG CTCATTGCAA AAGCCTGCGC AGTGATTTTC TATTGATTGG CGAAGTGGTG CATGGCGATT ATCGCCAATG GGCCAACCCA ACGATGCTCG ATAGCGTGAC CAACTATGAA GCCTACAAAG GCCTGTATTC CAGCCTGAAT GATCGCAACT ATTTTGAAAT TGCTTACAGT CTCAATCGTC AATTTGGCGC TGGTGGTATC TATCGGGCTA TGCCGTTGTA TAACTTTGTT GATAATCATG ATGTCGATCG AATCGCCAGC ACCTTGCACA ATCCAGCCCA TCTCTACCCT TTACATCTAT TGCTTTACAC CATGCCAGGA ATGCCTGCGC TGTACTATGG CAGCGAATGG GGCTTGCTCG GCCAGAAAAC CGCCACCAGC GACCAAGCCT TGCGACCAGC CTTGCCTCAA CCCGACCAAA TTCAGGCCCA ACAACCAGAT TTACTGGCAG CGATTCGCCA ATTGAGCCAG CTACGCCACG AATATGCTGC CTTACGTTAT GGCGAATATG CCCAAATATA TGTACAGGCT GAACAGCTTG TGTTTGTGCG TTGGACTAAG CAGCAAACAA TTGTGGTCGC ATTGAATGCC GCCCCAACCG CCCAAACAAT TCGATTTGTT GTGCCATTTG GTGAAGGCTC ACTGCTTGAA GATCGCTTGA ATGGCGGGGA AATCAAGGTG CAGCATGGCC AATTACAGCT GACAATTCCC GCAACCTGGG GCCAAATCTG GCAGCTGATT GATTGA
|
Protein sequence | MHPWANDVQC YQIYPLGLCG APIRNDQTSE PIARLKQLHT WIEHLQHLGS NLLYLGPVFE STAHGYDTID YFTVDRRLGS NNDLQQLIAA FHAAGIRVLL DGVFNHVGRD FWAFRDVQSH GQASSYSDWF AGLDFNQHSP YGDQFSYQGW HGHYDLVKLN LHNPAVREHL FQAVSQWIAQ FGIDGLRLDA ADQIDHDFLA ALAAHCKSLR SDFLLIGEVV HGDYRQWANP TMLDSVTNYE AYKGLYSSLN DRNYFEIAYS LNRQFGAGGI YRAMPLYNFV DNHDVDRIAS TLHNPAHLYP LHLLLYTMPG MPALYYGSEW GLLGQKTATS DQALRPALPQ PDQIQAQQPD LLAAIRQLSQ LRHEYAALRY GEYAQIYVQA EQLVFVRWTK QQTIVVALNA APTAQTIRFV VPFGEGSLLE DRLNGGEIKV QHGQLQLTIP ATWGQIWQLI D
|
| |