Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3705 |
Symbol | |
ID | 5735569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4658032 |
End bp | 4659885 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280857 |
Product | alpha amylase catalytic region |
Protein accession | YP_001546469 |
Protein GI | 159900222 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00647332 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACTACC CAACTTGGAC AAGCGCTGTG CACCACGATG GTTCGGCGCT GTACCTTCAA CCAAGCCAGC CCTATCACCT TGGTCAACAA GTAACTGTTC GCTTGCGAAC TCCATTAGCT GCGCCAATTA CCCAAGCCTT TATTCGCATC TGCCCCGATG GCGAGCAAAC GTTTGTGGCG ATGCAACCAG CCGAACGGAC TGAAACGATT CAATGGTGGC AAGGCCAAAT CACGCTCTCG ATGCCGCGCA CTGGCTATCG CTTTTGGCTG ATGACCGAGC AAGGCGGCTG GTGGCTTTCG GCAGCAGGCA TGCAACGTTC AACCCCTACC GATGCGACCG ATTTTAAGTT GTTGGCTGAT TATCATGCCC CAACCTGGGT GCATTCAGCA GTGTTCTATC AAATTTTTCC TGATCGGTTT TGTGATGGCG AGCCAAGCAA TAATGTGGTT GATGGCGAAT ATACAGTTTA TGGCAAGCCA ACGATTGCCC GCCAATGGGG CGAGGCTCCT CAGAAAGCCA CGGGTGGTAT CGAGTTTTTC GGTGGCGATT TACAGGGTAT TAGCCAAAAG CTTGATTATC TTGCGCAGCT AGGGATTAAT GCGCTGTATC TCACGCCGAT TTTTACTGCT CCCTCAAACC ACAAATACGA TACCGCCGAT TATCTGCAAA TCGATCAGCA TTTTGGCGGT GAGGCGGCTT TGGCTGAATT GCGCCAAGCA ACCCAACGCT ACCAGATGAA ATTGATGTTG GATATTGTGC TCAATCACTG TGGCTACACT CATCATTGGT TTACGGCGGC TCAAGCCGAT ACCAACGCGC CTACCGCTGA TTATTTTTCG TGGAAGCAAC ATCCCAACGA GTATGAATCA TGGTTAGGTC ATCGCTCATT GCCCAAACTC AATTACACCA GCCATGGCTT GCGCCAAGCA ATTTATGGCA GCGAACAGGC GATTGTGCGC CATTGGTTGC GTCAACCCTA TGCGATCGAT GGCTGGCGAA TCGATGTAGC CAATATGTTG GCCCGCCAAG GTTCGAGCCA GTTAGGGCAT AAAATTGGCC GCGCACTGCG CCGCGCCGTA AAAGCCGAAT CACCTGAAGC CTATTTACTT GGCGAACATT TCTATGATGG CACGAATCAC CTTCAAGGCG ATGAACTTGA TGCCAGCATG AACTATCGTG GCTTTACCTT CCCGACCTTG CAATGGCTAG TTGGCTTCGA TATGGCCTCG GTGTGGAACC TGGTTTGGGA AGATCGGGCC TTATTGCCGA CTGAAGCCTT GGGCGAGCAA TGGCTGGCCT TTTTGGCCGT GATTCCATGG CAAGTCGCTT TGCAACAATT CAATTTGCTC GATTCGCACG ATACGCCACG TTTGTTGACG ATTGTTGGTG GTGATCTGTC ATTACATCAC GTCGCAGTTA CCCTGCAAAT GACCTTCCCG GGCGTGCCCT GCATCTATTA TGGCGATGAA GTCGGCATGC AAGGCGGCGG CGATCCCGAG TGTCGCGGCT GTATGCCATG GGATGCACAA GTTTGGGATC ACGATCTGCT AGCTTTTTAT CGTTCGCTGA TTGGATTACG CCGTAGTTCG AGCGCTTTGA GTGTTGGCGG ATTTCAATTA TTGCTGGCCG AAGGCGATAC GGTGGCCTTT ATGCGGCGCA GCGCTGATGA ATGTTTGTTG ATCGTTGCCC AGCGGGCTGC TACCAGCATT CCCCCAATTC CAATGTTCGC AACCGGATTG ACTGATGGTA CAAGCTTTAT CAAAGTCGCT GGCACAACGA AAATTACGAT TCAGGCTGGG GTACTAGTTT TGCCACAAAC TGGCATTAGC GCCAGCATCT GGCAAATGCA GTAG
|
Protein sequence | MHYPTWTSAV HHDGSALYLQ PSQPYHLGQQ VTVRLRTPLA APITQAFIRI CPDGEQTFVA MQPAERTETI QWWQGQITLS MPRTGYRFWL MTEQGGWWLS AAGMQRSTPT DATDFKLLAD YHAPTWVHSA VFYQIFPDRF CDGEPSNNVV DGEYTVYGKP TIARQWGEAP QKATGGIEFF GGDLQGISQK LDYLAQLGIN ALYLTPIFTA PSNHKYDTAD YLQIDQHFGG EAALAELRQA TQRYQMKLML DIVLNHCGYT HHWFTAAQAD TNAPTADYFS WKQHPNEYES WLGHRSLPKL NYTSHGLRQA IYGSEQAIVR HWLRQPYAID GWRIDVANML ARQGSSQLGH KIGRALRRAV KAESPEAYLL GEHFYDGTNH LQGDELDASM NYRGFTFPTL QWLVGFDMAS VWNLVWEDRA LLPTEALGEQ WLAFLAVIPW QVALQQFNLL DSHDTPRLLT IVGGDLSLHH VAVTLQMTFP GVPCIYYGDE VGMQGGGDPE CRGCMPWDAQ VWDHDLLAFY RSLIGLRRSS SALSVGGFQL LLAEGDTVAF MRRSADECLL IVAQRAATSI PPIPMFATGL TDGTSFIKVA GTTKITIQAG VLVLPQTGIS ASIWQMQ
|
| |