Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4336 |
Symbol | |
ID | 5736196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5542255 |
End bp | 5544999 |
Gene Length | 2745 bp |
Protein Length | 914 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281497 |
Product | alpha amylase catalytic region |
Protein accession | YP_001547096 |
Protein GI | 159900849 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAACA AACGATTTGG CGGATTGCTC GCGTTGGTAC TGTTGGCCTT GGCCTTGTTG CCAAACCCGA AGTCGGTTGC CGCTGGCGAT AATAATGTGC ACTGGGATGA GCTGTATCAT GCTGCTCCCA GCGCCAACCC TCGCACTGAG CTTGTGCCAG GCGAAAGTTT CAGCTTTCAA CAAGCCGAAG TCAATGGCAC AATTGTTTCA ACCACCAATG TCCAAATTTC AATCTTAGCG CTGGCTGGCG ACCTAACCAG CGCCCAAATT CGCTACTGGA ACGGAACCGA GCAGATCGTA GCGATGAGCA AAGTCAAAAC GCTGACTGCC AGTTTCCGCA ACACCGCTAG CACCAGCTAC GATCTGTGGC GTGGCACAAT TCCAGCGCAT GCTGCTGGCT CAACCGTGTA TTATCGGGTT CGGGTGTATG ATGGCAGCGT TTTAGCCTTA CTCAAAGCCC AAAATGGCAG CTACACCAAT CCACGCGGCC AGCATGTACG TGGCGCAAAC TATGATCCCG ATGATTATAG TTTTACGGTG CAAAGCGGCG GTATGCCCAC AGCTACTCCA ACAATTGCCC CAACCGCCAC CCGCACCGCC ACTCCCAGCG CTACAGCCAC GCGCACGCCA ACCCCGGTTG GCACTATCAG CGCTACGCCC ACGCCATCGC GCACCCCAAC TGCGACTGCC ACCAACACCT CAACTCCCAC AGTTTCAGCA ACACCATCGG GGGCTTGCAG CGGCGCAGCG GTTGGCAACA ATACAATTAT CAGCAGTGCG GTGTACCACG ATAGCACCAA CAGCGTGTAT CGCGACCCGC TTGGTTCGCT GCAAGCAGGC CAATCGGCCA GCATTCGTTT GCGCACTTGT AGTAATGATG TTAGCGCTGT GAGTTTATCG GTTTGGTTAA CTGGCGCACC ATTCAGCCAA CCATCGTTCA GCTATCCATT AACTGTGGTA AGCAATGATG GAACCTACGC CATGTGGCAA GCCAGCGTAC CAGCCCCCAG CAGCTCAACC GATCAGTGGT ATCAATTTAA GCTGACCGAT GGCTCGACAA TTGGCTATTA TGTGGTTGCC AACACCAGCA ACACAGGTCC AGGCGTGTGG AGCGCCACCG CGCTTGATCG TTCGTGGAAG CTGGGTACAG TTCCCGCGCC ACCGCAAGAT TATGCCGTAC CAACATGGCT GCAAGATGCA GTGATCTATC AAATTTTCCC TGATCGCTTC CGTGATGGCG ATAGCAGCAA CAATTTCAAT AATGTGCGGG TCTACGGCCC AAACACCTGC AACGGCTACA GCGGTGCAGG TGCACCCAAC TGTTTGGCCT CGATCCACAG CAATTGGAAT GAAACGCCAA CCACCCCAGG CTATGGCATC GATTTTTATG GTGGCGATTT GCAAGGCATT GTCGATAAAA TCAATGCTGG TTATTTTAAC GACCTTGGCG TTAATGTGCT GTATCTCAAC CCGATTTTTG ATGCTTCATC GAACCATGGC TACGATACCA ACGATTATTA TGGTATCAAC CCACGCTTTG GCAATTTGGC AAAATTCGAC GAGATGATTG CCGCCGCCGA TGCCAAAGGC CTCAAAGTGA TTCTTGATGG TGTGTTCAAC CACGCTGGCA TGGATAGCAT TTATCTTCAA GGTTATCCAG GCTATAAAAC CGACCGCTGG ACAGGCATCA ACGGCGCTTG TGAATCGGAT TCATCGCCCT ACCGCAGTTG GTTCACCCAA GGCTCAGCTG GCACCAGCGG CTCGTACCCA TGTGTTGGCG GTTGGGGCTG GAAGGGTTGG TATGGCTATG AAACCATCCC TGAATTTATC GAAAACGACC CAGTGAAGCA ATTTTTCTAT CGCGATGGCA GCGCTCAAAG CCCCAATGGC AAATCAGTAA CCCGCTTCTG GCTCGAACGG GGTATCGCTG GCTGGCGCTT CGATGTAGCC CAAGATATCA CCCACGCTTG GTGGAGCGAT ATGCGGCCTT ATGTTAAAAA TGGTTATGGC GATAGCGAAA GTTTATTGCT GGGCGAAGTT ACGGGCGGCT GTGATTGGGG CTTATATCGT GCCTATCTCA ACCAAAACGA GCTTGATTCG GTGATGAACT ATTGTTTCCG TGATTGGGCA GTGAGCTTCG CCAATGGCAA TGCACCTAGC TCATTCGACA GTAGCTACAA TGCCTTCCGT GCGCAAATGC CTGCTAGCCC ATGGTTTGGC ATGATGAACT TAGTCAGTTC GCACGACTCA ACTCGCGCCT TGCGCTTGCT CAACGACGAC AAAGCCCGCA TGAAATTGAT GGTGTTGTTG CAAATGACCC TACCAGGTGC GCCATCGGTG TATTATGGCG ACGAAGTTGG GGTAACTGGT GGCGGCGATC CCGACAACCG CCGCACCTAT CCTTGGGCCG ATAAAGGTGG TAGCCCCGAT ACGGTGATGT ATGCTCATTT CAAGAAATTG ATCGCGCTAC GCCGTACCTA TCCAGCGCTC AGCAGTGGTG ATGTTGCAAC CTTGTTGGTC AACGATGCCA GCAAACTTTA TGGCTATCGC CGCTGGAAAG GCACGCAAGA GGCAGTTGTA GTGCTCAATA ACGGCACGGC CAACCAAACT GCGACAGTTA ATGTGAGCCA TTTAGCCAAT GGCACAGTCT TGACCGATGT CTTGAATGGT GGCAGCTACA CCGTTAGCAA CGGCCAATTG ACCTTGCCAG TCGCAGCCCA ATCGGGCGTG GTGCTGGTGA AGTAA
|
Protein sequence | MRNKRFGGLL ALVLLALALL PNPKSVAAGD NNVHWDELYH AAPSANPRTE LVPGESFSFQ QAEVNGTIVS TTNVQISILA LAGDLTSAQI RYWNGTEQIV AMSKVKTLTA SFRNTASTSY DLWRGTIPAH AAGSTVYYRV RVYDGSVLAL LKAQNGSYTN PRGQHVRGAN YDPDDYSFTV QSGGMPTATP TIAPTATRTA TPSATATRTP TPVGTISATP TPSRTPTATA TNTSTPTVSA TPSGACSGAA VGNNTIISSA VYHDSTNSVY RDPLGSLQAG QSASIRLRTC SNDVSAVSLS VWLTGAPFSQ PSFSYPLTVV SNDGTYAMWQ ASVPAPSSST DQWYQFKLTD GSTIGYYVVA NTSNTGPGVW SATALDRSWK LGTVPAPPQD YAVPTWLQDA VIYQIFPDRF RDGDSSNNFN NVRVYGPNTC NGYSGAGAPN CLASIHSNWN ETPTTPGYGI DFYGGDLQGI VDKINAGYFN DLGVNVLYLN PIFDASSNHG YDTNDYYGIN PRFGNLAKFD EMIAAADAKG LKVILDGVFN HAGMDSIYLQ GYPGYKTDRW TGINGACESD SSPYRSWFTQ GSAGTSGSYP CVGGWGWKGW YGYETIPEFI ENDPVKQFFY RDGSAQSPNG KSVTRFWLER GIAGWRFDVA QDITHAWWSD MRPYVKNGYG DSESLLLGEV TGGCDWGLYR AYLNQNELDS VMNYCFRDWA VSFANGNAPS SFDSSYNAFR AQMPASPWFG MMNLVSSHDS TRALRLLNDD KARMKLMVLL QMTLPGAPSV YYGDEVGVTG GGDPDNRRTY PWADKGGSPD TVMYAHFKKL IALRRTYPAL SSGDVATLLV NDASKLYGYR RWKGTQEAVV VLNNGTANQT ATVNVSHLAN GTVLTDVLNG GSYTVSNGQL TLPVAAQSGV VLVK
|
| |