Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2832 |
Symbol | |
ID | 5734713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3599922 |
End bp | 3601127 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279975 |
Product | Alpha,alpha-trehalase |
Protein accession | YP_001545598 |
Protein GI | 159899351 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1626] Neutral trehalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.613992 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACAGC CTGCAATCAC GCAGTATATC GATGCTTATT GGGACAAACT TGTTCGTCAA CAGCCGGAAG ATCAAAAAAC CTTAATTGGC TTGCCCCATC AATTTATGGT GCCAACCCAC GACCCTACCT TTCAAGAGAT GTTTTATTGG GATAGTTTTT TCATTGCGCT GGGGTTGGGT GGCACGCGCT ACGAAGCGGT AATCGAGGGC ATGGCCGAAA ACATGGCCTA TCTCTACCAG CGCTTCGGGG TGATTCCTAA CGCTAGCCGT TATTATTTTC TTTCGCGCAG CCAACCACCT TTCTGGACTC AATTGATTTG GCTGGCCTAC CAAACCAAAC AGGCCGCTGG CGATCCTGAT AGCGCTGCTT GGTTACAACG CCTGATGGCA CTCGCTGAGC AGGAGCATGC CAGTGTTTGG CTGGCTACCA CCCATCCGCA TCAGCGCCAA GTGCATCGCG GGTTGTCGCG CTATTTCGAT ATTAACTATT TGGATACCCT GGCCTGTTGC GAAAGTGGCT GGGATCATTC AACCCGCTGC AATGGCCAAT GGATGAGCCA TTTGCCAGTT GATTTGAATA GCATTTTGTA TCTGCGTGAG TGCGATTTTG CCCAAGCCGC CCGTGTGCGC GACGATCACG CGGCTGCCGA GCAATGGCAA TCCTGCGCCG ATCAACGCGC CGAAACCATG CAAGCAGTTT TTTGGGATGC CGCTAGTGGC TTTTTCTATG ATTACAATTA TCTCAATGAA GTGGCTGACC TAGATAACCC TTCGTTGGCC GGATTTTACC CCTTGTGGGC TGGTTGGGCG ACCGAAGTTC AGGCGGCGCA GGTGGTCGAG CAATGGTTGC CAAGCTTTAT GCGAGTTGGT GGTTTGGTGA CAACGCTCAA AACCCATGCT AGCTATCAAT GGGCTAGCCC CAACGGCTGG GCACCATTGC AATGGATTGT CGTCGAGGGT TTGTTACGCT ACGGCTATCA ATCCCAAGCG CGTGAGGTCA TGCAAGCATG GTGTACGCTC AACGAAACTG TCTTCGAGCG AACCAACGCC ATGTGGGAAA AATATAACGT GGTTGACCCA ACGGGCGAAG TTGAGGGCGG CAAATATGGC TCGTTGCCAG GCTTTGGCTG GTCGAATGCG GTTTATCTCG ATTTCAAGCG CCGCTTAGCC CAACCAACTA TCGAACGCTG GAAGCTTGGC GAATAA
|
Protein sequence | MRQPAITQYI DAYWDKLVRQ QPEDQKTLIG LPHQFMVPTH DPTFQEMFYW DSFFIALGLG GTRYEAVIEG MAENMAYLYQ RFGVIPNASR YYFLSRSQPP FWTQLIWLAY QTKQAAGDPD SAAWLQRLMA LAEQEHASVW LATTHPHQRQ VHRGLSRYFD INYLDTLACC ESGWDHSTRC NGQWMSHLPV DLNSILYLRE CDFAQAARVR DDHAAAEQWQ SCADQRAETM QAVFWDAASG FFYDYNYLNE VADLDNPSLA GFYPLWAGWA TEVQAAQVVE QWLPSFMRVG GLVTTLKTHA SYQWASPNGW APLQWIVVEG LLRYGYQSQA REVMQAWCTL NETVFERTNA MWEKYNVVDP TGEVEGGKYG SLPGFGWSNA VYLDFKRRLA QPTIERWKLG E
|
| |