Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3073 |
Symbol | |
ID | 5734945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3880872 |
End bp | 3882305 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641280217 |
Product | alpha amylase catalytic region |
Protein accession | YP_001545839 |
Protein GI | 159899592 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACCA GCGATTGGCA AACGCCCGAT TGGGTCAAAC ATGCCGTCTT TTATCAGATT TTCCCTGAGC GCTTTGCCAA TGGTGATCGG ACAAATGATC CGGCAAATGC GCAACCTTGG GGTACAAGCC CAACCTTGTA TAATTATATG GGCGGCGATC TACAAGGAAT TATCGATAAG CTTGATTATT TAGTGGATTT GGGCATTAAT GCGCTGTATC TCAACCCAAT TTTTCAAGCC ACCACCTCAC ATAAATATAA TACCTTCGAT TATTTTAAAA TCGATCCCCA TTTTGGTACG CTAGAAACGT TTAAAACCTT ATTGAATGAA GCGCATCGAC GTGGCATTAA AGTTATTCTC GATGCGGTGT TTAATCATTG CGGTCGCGGC TTTTTTGCCT TTCACGATGT AATTGAAAAT GGTGTGCACT CGCCCTACAC CAATTGGTTT CATATCTCAC GCTTTCCAAT TCATCCCTAT GAATCGCGCT ATGCCGCTAA TTATCGCACG TGGTGGGATT TTCGCGAGTT GCCCAAATTC AACACCGATA ATCCGGCGGT ACGCAAATAT TTGCTTGATG TAGCTCGCTA TTGGATTGAA TTGGGTATTG ATGGTTGGCG CTTGGATGTG CCAAATGAAA TTGATGATCA TAATTTTTGG CGTGAGTTTC GCACAATTGT CAAAGATATC AATCCTGAAG CCTATATTGT GGGCGAAATT TGGACTGACG GCTCAGCTTG GCTGCAAGGC GATCAATTTG ATGCCGTGAT GAATTATCTA TTTCGCGATT TATGTACCGA TTTCTTTGCT AGCTATCGGG TACGTGCCGC TGATTTTGCG GCTGGAATTG ACCATTTAAT TGTGCGTTAT CAGCCCCAAG TGACCTATGT CCAATTTAAT TTGCTTGGTT CACACGATAC TGCGCGGTTT TTGAGTGTGG CTGAAGAAGC TGGTAAATGG GCTTTAGAGC GCATGAAATT GGCGGTTTTG TTCAAATTAA TCTTTCCTGG TGCGCCATGT ATCTATTATG GCGATGAAAT TGGCTTGCAT GGCGGCAAAG ATCCTGATTG TCGGCGTTGT TTCCCGTGGG ATCAACCGCA AACCTGGCAG ACCGATCTCC AAGCTTGGAC CAAACGCTGG GTTAAGTTTC GCCATGAGCA TACAGCCTTG CGCACGGGCC ATTATGCGAC GCTGTTTGCC GACAACGATA TGAATATTTT TGCTTGTGCC CGTTGGGATG AGCAAAGCCA ATTTGTGATT GTGCTGAATA ATAACGAAAC ACCTTGGACA CTCGATTTGC CGTTGCATGC CCAATTACCA AGCGTCACTC ATTATCGCGA TGTGCAAACT GGCGAGTTGT ATAGCGTGGC CGAGGGTAAA ATTCGCGAGG TAGCATTGGC TCCGTGGAAG CATTTGGTAT TACAAGCTGA ATAG
|
Protein sequence | MTTSDWQTPD WVKHAVFYQI FPERFANGDR TNDPANAQPW GTSPTLYNYM GGDLQGIIDK LDYLVDLGIN ALYLNPIFQA TTSHKYNTFD YFKIDPHFGT LETFKTLLNE AHRRGIKVIL DAVFNHCGRG FFAFHDVIEN GVHSPYTNWF HISRFPIHPY ESRYAANYRT WWDFRELPKF NTDNPAVRKY LLDVARYWIE LGIDGWRLDV PNEIDDHNFW REFRTIVKDI NPEAYIVGEI WTDGSAWLQG DQFDAVMNYL FRDLCTDFFA SYRVRAADFA AGIDHLIVRY QPQVTYVQFN LLGSHDTARF LSVAEEAGKW ALERMKLAVL FKLIFPGAPC IYYGDEIGLH GGKDPDCRRC FPWDQPQTWQ TDLQAWTKRW VKFRHEHTAL RTGHYATLFA DNDMNIFACA RWDEQSQFVI VLNNNETPWT LDLPLHAQLP SVTHYRDVQT GELYSVAEGK IREVALAPWK HLVLQAE
|
| |