Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1148 |
Symbol | |
ID | 5733041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1316404 |
End bp | 1318542 |
Gene Length | 2139 bp |
Protein Length | 712 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278288 |
Product | alpha-glucan phosphorylase |
Protein accession | YP_001543924 |
Protein GI | 159897677 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0058] Glucan phosphorylase |
TIGRFAM ID | [TIGR02094] alpha-glucan phosphorylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.191282 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGTTG CTGAATTACC ACCGCTGTTT GCCCCCCTAC CCCAACGGAT CAACCGACTC TATGAGTTGG CTTATAACCT TTGGTGGAGC TGGCAACCAG AAGCCCAAGC GCTTTACCAA ACGATTGATG CAACGTTGTG GGAGCGCACA GGCCATAATC CGGTGAAATT TTTGCGTGAA GTTAGCTTGG CTAGCCTCGA ACAAAAAGCC CTTGATAGCA ATTATTTGGC CCAGTATGAT CGGGTAACGA TCAAATTTGA TCATTATATG CGTGATGATC AAACGTGGTT TGCCCGCACC CATCCTGAGC TTAAAGATCA AACGATTGCC TATTTTTCGG CTGAATTTGG CATTCACGAG TCGTTGCCAA TTTACTCTGG TGGCTTGGGC ATTTTGTCGG GCGACCATTG TAAAGAAGCC AGCGATATTG GTTTGCCATT CGTTGGGGTT GGCTTTTTAT ATCCTCAAGG TTATTTTCGC CAATTTATCA ATAGCGAAGG CACCCAAGAA GCGCTCTACG AACGCTTGAA TTTTGCTGAG TTGCCAGCCT TGGCTGTGCG CAATAGCGAT GGCAGCGAAT TACGCGTGAG CGTCGATTTG CCCGGGCGCA TGGTGTTTGT CAAAGTTTGG CGCTTCCAGG TTGGCCGCAT CACGCTGCTG CTGATGGATA CCGATGTTGA CGAAAATCGC CCTGAAGATC GCGAATTGTC GGCTCGTTTG TATGGCGGCG ATCAATGGCT GCGGGTCGCT CAAGAAATCA TCTTGGGGAT TGCTGGCGTG CGAGTGTTAC GCGCTCTCAA TATCAATCCA ACAGTTTGGC ACATGAATGA AGGTCACTCA GCATTCTTGG GCTTGGAATT GTTGCGCGAA CGAGTTCAAC GTGGCGAAAA CCTTGAACAT GCCGTCAGCG AAGTGCGCAA ACGCTCAGTC TTTACCACCC ATACGCCAGT GCCTGCTGGC AACGATGCCT TCCCACTCGA TCTGATCGAT CAATTCTTCC ACAACTACTG GCCACAATTG GGCATTGATC GCGACACCTT TATGAATATT GCGTTGCAGC AACAAAGCTG GGGGCCAACC TTCAGCATGA CCGTGCTCGC ATTGCGCTTA TCGGAATATC ATAACGGCGT GAGCGAATTG CACGGCGCTG TGGCTCGTCA AATGTGGCAA TTCCTGTATC CAGGCAAATC AGTTGACGAA GTGCCAATTG GCCATGTGAC CAACGGCGTG CACTCGGATA CGTGGTTGGC TCCAGCCATG GCCAATATGT ACGACTCGGT GCTTGGCGCT GGTTGGCGCG AACGGATGGA CGACCCTGCA ACATGGGATC GCATCACGAC GATGCCCGAT GAAGTGCTTT GGGGCGTACA CTCAGGCCAA AAGCACGATT TGGCTCGCTT TATTCGCGAA CGCGCTTGGC AGCGCCAACG TCGTTTAGGC TACCATCGCG ACGATGCAAT TAGCGGCGAT GTTAACCCCA ACGCGTTGTT GATTGGCTTC GCACGCCGAT TTGCCACCTA CAAACGGGCG ACCTTGATTT TCCGCGATTT GGATCGGATC AAGCGGATTT TGAACAACGC TGAGCGTCCC GTTCAAATCA TTTTCGCTGG TAAATCGCAC CCAGCCGATG AGCCAGGCAA GAGCTTTATT CGCGCCGTTT ACAATATGGC TCGCGACCCA GAGCTGCGTG GCAAGATCTT CTTTATCGAA GAATATGATA TTAACGTTGG CCGCCATTTG GTACAAAGTG TTGACGTATG GTTGAACAAC CCACGCCGAC CATTGGAAGC ATCGGGAACC AGCGGCGAAA AAGCGGGCAT GAACGGTGTG CCAAACCTAA GCGTGCTTGA CGGCTGGTGG CGCGAAGCCT ACAACGGCAA GAACGGTTGG GCGATTGGGC GCGAAGCTGG CTACGACGAT CTTGAACAAC AAGATATTGA CGATGCTGAA TCGTTGTATA GCTTGTTGGA AAACGAAGTT ATTCCATCGT TCTATGATCG TGATGCCAAT GGCTTGCCCC ACAAATGGAT TGCGACCATG AAGGAATCGA TTCGCACGGT TGCCCCACAA TTTAGCTTCC GTCGCATGCT CAAAGATTAC GTAGCCCAAT ACTACGTGCC AGGCATGCAA GCCGAATAA
|
Protein sequence | MNVAELPPLF APLPQRINRL YELAYNLWWS WQPEAQALYQ TIDATLWERT GHNPVKFLRE VSLASLEQKA LDSNYLAQYD RVTIKFDHYM RDDQTWFART HPELKDQTIA YFSAEFGIHE SLPIYSGGLG ILSGDHCKEA SDIGLPFVGV GFLYPQGYFR QFINSEGTQE ALYERLNFAE LPALAVRNSD GSELRVSVDL PGRMVFVKVW RFQVGRITLL LMDTDVDENR PEDRELSARL YGGDQWLRVA QEIILGIAGV RVLRALNINP TVWHMNEGHS AFLGLELLRE RVQRGENLEH AVSEVRKRSV FTTHTPVPAG NDAFPLDLID QFFHNYWPQL GIDRDTFMNI ALQQQSWGPT FSMTVLALRL SEYHNGVSEL HGAVARQMWQ FLYPGKSVDE VPIGHVTNGV HSDTWLAPAM ANMYDSVLGA GWRERMDDPA TWDRITTMPD EVLWGVHSGQ KHDLARFIRE RAWQRQRRLG YHRDDAISGD VNPNALLIGF ARRFATYKRA TLIFRDLDRI KRILNNAERP VQIIFAGKSH PADEPGKSFI RAVYNMARDP ELRGKIFFIE EYDINVGRHL VQSVDVWLNN PRRPLEASGT SGEKAGMNGV PNLSVLDGWW REAYNGKNGW AIGREAGYDD LEQQDIDDAE SLYSLLENEV IPSFYDRDAN GLPHKWIATM KESIRTVAPQ FSFRRMLKDY VAQYYVPGMQ AE
|
| |