Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4065 |
Symbol | |
ID | 5735923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5190483 |
End bp | 5192273 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281216 |
Product | Alpha-amylase |
Protein accession | YP_001546825 |
Protein GI | 159900578 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.540437 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACGGA CCACTCGCGC AATGGCGCGG CTGACACTGT TCGTCTTGGT TGTGATCATC TTCTCGGTTG GCTTTTTTGG AACCGCTCCA CGGGCAACGC AAGCCCAAAC TACTCCACGC ACGGCCTTCG TCCATTTATT CGAATGGAAA TGGACTGACA TCGCCAAAGA ATGCGAAAAT TGGCTCGGCC CCAAGGGCTT CGCTGCTGTT CAAGTCTCGC CACCCCAAGA GCATATCCAA GGCTCGCAAT GGTGGACGCG CTATCAACCA GTTAGCTATC AGATTCAAAG CCGCTCGGGC ACTCGCGCCG AGTTCGCCAA CATGGTTTCA CGCTGTAAAG CTGTTGGGGT TGATATTTAT GTCGATGCCG TGATCAACCA CATGACCGGC GTGGGCAGTG GCACGGGCGT AGCTGGCTCA AGCTACACCA GCTACAATTA CCCCGGTAAT TATCAAACTC AAGATTTCCA CCACTGTGGC CGCAATGGCA ACGACGATAT CAGCAACTAC CAAGATCGCT GGGAAGTTCA AAATTGTGAG TTGGTTAACC TCGCCGATCT CAAAACTGAA TCAGATTATG TTCGCGGCAA ATTAGCTGCC TATTTGAATG ATCTGCGCAG TTTGGGCGTA GCTGGCTTCC GCATTGATGC TGCCAAGCAT ATGCCCGCCG CTGATATTGC CAACATCATG AGCCGCGCCA GCAATCCTTA CATCTATCAA GAAGTGATTG ACCAAGGCGG CGAGCCAATT ACCTCAGGCG AATATACGGG CAACGGCGAT GTGACTGAGT TCAAATACAG CACCAACATT GGCCGCATGT TCAAAACCGA CAAGCTTGCC AACATGAGCA ACTTCGGCAC AGCTTGGGGC TTTATCGCCA GCGATAGTGC GGTGGTTTTC ACCGATAACC ACGACAACCA ACGCGGCCAT GGCGGCGCTG GCAATGTCGT TACCTTCAAA GATGGCAAAC TCTACGAACT TGCCAACGTC TTCGCTCTAG CTTGGCCCTA TGGCTATCCC CAAGTCATGT CGAGCTACAA CTTCAGCAAC GGCGACCAAG GCCCACCCAG CAGCAATGTC TACAATGGCA ACACCGCCGA TTGCGGTGGC AGCAACTGGG TTTGTGAACA TCGCTGGCGC GGCATCGCCA ATATGGTTGG CTTCCGCAAC TACACTAGCA CAGCCTTCAG CACCAGCAAC TGGTGGTCGA ATGGCAATAA TCAAATTTCG TTCAGCCGTG GCAGCTTGGG CTTCGTAGCA ATCAACCGCG AAGGCAGCAG CTTGAGCCGC ACCTTTGCTA CGGGCTTGCC CGCCGGAACC TACTGCGATG TAATTCACGG CGATTTCAAC AATGGCTCGT GCTCTGGCCC AACCATCAGC GTCAACAGCA GTGGCCAAGC AACAATCACG GTCGCCGCAA TGGATTCAGT GGCAATTCAT GGTGGCGCAA AAATCAATGG CACTAACCCA ACGCCAGTGC CAACCACCCC ACCAAGCGGC AGCATCGCTG TCACCTTCAA CGAAAATGCC ACCACGGTTT GGGGCCAAAA TGTCTATGTG ATTGGCAATG TCTCGGCACT TGGTAGCTGG AACACCGCCA ATGCTGTGTT GCTCTCATCA GCAAGCTACC CAGTTTGGAG CAAGACAATC AACTTGCCAG CCAGCACCGC CATCGAATAC AAATACATCA AGAAAGATGG TTCGGGCAAT GTGACCTGGG AAAGCGGTAG TAACCGTACA TTTACCACGC CAAGCAGCGG CACGGTCACC CGCAACGATA CCTGGAAATA G
|
Protein sequence | MSRTTRAMAR LTLFVLVVII FSVGFFGTAP RATQAQTTPR TAFVHLFEWK WTDIAKECEN WLGPKGFAAV QVSPPQEHIQ GSQWWTRYQP VSYQIQSRSG TRAEFANMVS RCKAVGVDIY VDAVINHMTG VGSGTGVAGS SYTSYNYPGN YQTQDFHHCG RNGNDDISNY QDRWEVQNCE LVNLADLKTE SDYVRGKLAA YLNDLRSLGV AGFRIDAAKH MPAADIANIM SRASNPYIYQ EVIDQGGEPI TSGEYTGNGD VTEFKYSTNI GRMFKTDKLA NMSNFGTAWG FIASDSAVVF TDNHDNQRGH GGAGNVVTFK DGKLYELANV FALAWPYGYP QVMSSYNFSN GDQGPPSSNV YNGNTADCGG SNWVCEHRWR GIANMVGFRN YTSTAFSTSN WWSNGNNQIS FSRGSLGFVA INREGSSLSR TFATGLPAGT YCDVIHGDFN NGSCSGPTIS VNSSGQATIT VAAMDSVAIH GGAKINGTNP TPVPTTPPSG SIAVTFNENA TTVWGQNVYV IGNVSALGSW NTANAVLLSS ASYPVWSKTI NLPASTAIEY KYIKKDGSGN VTWESGSNRT FTTPSSGTVT RNDTWK
|
| |