Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2969 |
Symbol | |
ID | 5734841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3745721 |
End bp | 3747295 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280113 |
Product | alpha amylase catalytic region |
Protein accession | YP_001545735 |
Protein GI | 159899488 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGC TTTTGAGTGG CCTGCTTTTA GCCGGGGTGA TTGTGGGCTG TGGCGGCCAA GCAACTCCAA CCGCTGTTCC GGCAACAGTG ACCAGCCAAT CAACCCCAAC CAGCCAAGCC TTGAATCCAA CCGCTACTGC CGCTACTGTG GTTGATAACT CAATTACGCC AACGCCATTG CCAACCAAAC CAACAGCCCC AATCTTTACC GCTGACGATG AGCGTTGGGC TGGCCGTTCG ATCTACTTTA TTATGATCGA TCGCTTTGCC AATGGCGACC CAAGCAACGA CAACGCCGAT GGCTTTGGGG CAGATCGTAG CGATCCACGG CGTTGGCATG GCGGCGATTT TCGCGGCATT ATCGAGCGGC TCGATTACAT CAAAGGCATG GGCTTTGGCG GCATTTGGAT CACGCCAGTC AGTAAGCAAA ATTCAACCAA TGCCTACCAT GGCTACTGGC AATACGACCC CTACCAAATT GACCCGCATT TTGGCACGCT GGAAGAATTG CGCGAATTGG TCAGCGAAGC CCACAAACGC GATATATTGG TGATGCTCGA TGTTGTGCCC AATCATATGG GCGATTTCTT GCCTGGCTCG AAAGCTGCCC CGCCATTCGA TGACCCAACC TGGTATCACA ACAAGGGCAA CATTCAAAAT TATGGCAATC AACAAGAGGT TGAAGATGGC GATTTGCTCG GGCTTGATGA TTTAGATCAG GATAATCCTG CTACCCGTGC TGAATTACTC AAATGGATTG CTTGGCTTAA AACCGAAACT GGGCTTGATG GCTTGCGAGT TGATACGGCC AAACATTTGC CCAAAGATTT TCTCCGTGAG TTTGATCAAG CGGCCAATAC GTTTTCGCTG GCTGAGGTAT TTAGCAGCGA TGCGGGCTAT GTTGCGCCCT ACACCGAATT TAACGACGCA ATTTTGGATT ACCCCTTGCA CAGCGCCTTT AAAGAAAGTT TAGTCGGTGG TCGCACGTTG TTGGTGATTC AGCGCGTGCT CGAAAATGCC GATCAACAGT ATCGCAATGT CCATGTCAAC GGCACATTTC TCGATAATCA CGATAACGAG CGCTTTTTAT GCTTGGCAAC TGGCGGCCCC AACGCCGATA AAACCACTCA ATTGCGGCAA GCTTTGGCGG TGCTCTATAG TTTGCGTGGC ATTCCGATTG TCTATTATGG CACCGAGCAA GAACTCAACG GCTGCAAAGA TCCCTTCAAC CGCGAAGATG CCTTTGAATT GAATGCGACT GATGTACCAG TCTATCAATG GATCAGCCAA CTCAACCAGA TTCGCCAAGC CCATCCAGCC TTGCAACGTG GCACACTCGA AAGCCGCACA ACTCCTAGCG ATGCATGGGC CTTTCAACGC ACGGCGGGCA ACGATACGGT CGTAGTTTGC ATCAATAACA CATGGAAATC GCTCGACTTG GCAGTAACTG GTTTGACTGA AATTGCTGAT GGTGAGGTGT TGACTGATGC GCTTGGTAGC GGTCAAATGA GCGTTAAAAA TGGCGAAATG AATTGTGCTC TACAACCAAA ACAGGTGCTG ATCTATACCC GTTAA
|
Protein sequence | MKKLLSGLLL AGVIVGCGGQ ATPTAVPATV TSQSTPTSQA LNPTATAATV VDNSITPTPL PTKPTAPIFT ADDERWAGRS IYFIMIDRFA NGDPSNDNAD GFGADRSDPR RWHGGDFRGI IERLDYIKGM GFGGIWITPV SKQNSTNAYH GYWQYDPYQI DPHFGTLEEL RELVSEAHKR DILVMLDVVP NHMGDFLPGS KAAPPFDDPT WYHNKGNIQN YGNQQEVEDG DLLGLDDLDQ DNPATRAELL KWIAWLKTET GLDGLRVDTA KHLPKDFLRE FDQAANTFSL AEVFSSDAGY VAPYTEFNDA ILDYPLHSAF KESLVGGRTL LVIQRVLENA DQQYRNVHVN GTFLDNHDNE RFLCLATGGP NADKTTQLRQ ALAVLYSLRG IPIVYYGTEQ ELNGCKDPFN REDAFELNAT DVPVYQWISQ LNQIRQAHPA LQRGTLESRT TPSDAWAFQR TAGNDTVVVC INNTWKSLDL AVTGLTEIAD GEVLTDALGS GQMSVKNGEM NCALQPKQVL IYTR
|
| |