Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4334 |
Symbol | |
ID | 5736194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5534064 |
End bp | 5535914 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281495 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001547094 |
Protein GI | 159900847 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.638626 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATACA CCCTCGAATC AACAGGCTTA TTGGTATTTC AACGTGATGA CGAAGAAGTT TTGACGATTC AGCCCTACTT GCAGGCTTAT GGCTCGCAGT TTGCCGCATT AGGTGTACAA ACCAAGCCTG ATACGGTGCA ATACACTAGT TTCGATAACC AGACCTTTGA GCTTAATCTT GAAACGACGG CAACCGGCTA TCAGTTGGTG TTTCACGTGA AACATCAGGT TGCTCAAATT GGTTTAGCAA TCGGCGTACA ACCAGCAGGC TCGTGGTATG GTATGGGCGA ACGGGTGATT CAAAGTTGGC CGCTCAACTT GGCTGGTGTG CAAAGCCAAC CATTTATGAC CTATGATCAC GCCAATGATG GCACACTCAA TATTGTTACG CCAGCCTGGA TTGGCGCGAA TGGCGTGGCT TTTATCGTGG CCGAAGATAC AGGCCCGTTG CATGTCACAA TCGATAGCGA TCCAGCAGGT GTGATTCGGC TGGTGCAGTT TCCTTCGCCA ACCCCATTTG GCGCGGGCTT GGATGGCAGC GAAACCCATT ATGAAGGCAC GCGCTTGGTG CTCGATCTAC TGATTGCTGA AAATGTCAGC GTTGCCGCTC AACACGTCAT TCAACAATTG GGCTACCCCA AGGCGGCTCC ACCCTTGGCA ATGTTTAGCA AGCCAATTTG GACAACCTGG GCACGCTATA AAATGGATAT CGATCAAGCT CAAACGCTGG CGTTTGCCCA AGAAATTATC GACCAACAGT ATCCATATTC GGTGCTAGAG ATCGACGATC GCTGGCAAAC TGCTTATGGC GATTTAGAAT TCGATCGGCG CAAGTTTCCC GATCCCAAGG CTATGGTCGA TCAATTGCAT CAGCTTGGCT ATAAAGTAAC CTTGTGGATT CCACCATTTT TCGATCCAAA GAGCGCGGCT TTTGCTGAAG CTACTGCTAA TGGCTATTTG GTCAAACATC CCGCCAACGA TCAACCGTAT TTGACCCGTT GGTGGCAAGG TTGGGGCGGT TTGTTGGATG TCTCGAATCC TGCTGCCTTG GCTTGGTGGC AAGCAGGTTT GGAGCGCTTG CAAACCTTGT ATGGCATCGA TGGTTTTAAA TTTGACGGCG CTGAAGGGAA TTTTCTACCC GCCGAAGCCA AAACCCATCT GCCGATGACT CCCAACCAAT ATAGTGATCG CTATGTGGCT TTTGTGGCCA AATCGTGGCA ATGGACAGAG GTACGAACTG GTTGGCGCTC GCAACAGCAA CCAATCTTCT TCCGCGAATG GGACAAATGG AGTCGTTGGG GCATGGACAA TGGCTTGCAT GCGGTCGTTA CCCAAGCGCT TGCGATGAGC GTGATCGGTT ATCCCTATGT GCTGCCCGAT ATGATCGGCG GCAACGCCTA TAACGGTGAA TTTCCCGAGC GCGAGTTGCT GATTCGTTGG ACGCAAGTCA CGGCATTATT GCCAGCGATG CAATTTTCAA TCGCGCCATG GCAGTACGAT GTAGAAACCA GCCAGATTTG CCAGCGCTAT GCTCAATTGC ACGCTGAGCT AGAGCCATAC ATTGCCGAAT TGGTGCAAGC GACCATCACC GATGGTACGC CTTTAGTTCG GCCTTTGTGG TGGCACTATC CCGACGATGC CAGCACGCGC TTTATTGGTG ATCAATGGTT GTTTGGCGAG CAATACTTGG TTGCGCCAAT GCTCCAAGCC AACCACTACC AACGTGACAT TTATTTGCCT GAAGGTGGCT GGCGCGATTA TTGGACTGGC GAGAAATTCG AGGGTGAAAC CTGGCTCTAC AATTATCCTG CGCCCTTAGA AACCCTGCCG TTGTTCGAGC GGCTGTGGTA G
|
Protein sequence | MAYTLESTGL LVFQRDDEEV LTIQPYLQAY GSQFAALGVQ TKPDTVQYTS FDNQTFELNL ETTATGYQLV FHVKHQVAQI GLAIGVQPAG SWYGMGERVI QSWPLNLAGV QSQPFMTYDH ANDGTLNIVT PAWIGANGVA FIVAEDTGPL HVTIDSDPAG VIRLVQFPSP TPFGAGLDGS ETHYEGTRLV LDLLIAENVS VAAQHVIQQL GYPKAAPPLA MFSKPIWTTW ARYKMDIDQA QTLAFAQEII DQQYPYSVLE IDDRWQTAYG DLEFDRRKFP DPKAMVDQLH QLGYKVTLWI PPFFDPKSAA FAEATANGYL VKHPANDQPY LTRWWQGWGG LLDVSNPAAL AWWQAGLERL QTLYGIDGFK FDGAEGNFLP AEAKTHLPMT PNQYSDRYVA FVAKSWQWTE VRTGWRSQQQ PIFFREWDKW SRWGMDNGLH AVVTQALAMS VIGYPYVLPD MIGGNAYNGE FPERELLIRW TQVTALLPAM QFSIAPWQYD VETSQICQRY AQLHAELEPY IAELVQATIT DGTPLVRPLW WHYPDDASTR FIGDQWLFGE QYLVAPMLQA NHYQRDIYLP EGGWRDYWTG EKFEGETWLY NYPAPLETLP LFERLW
|
| |