Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3122 |
Symbol | |
ID | 5734994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3939764 |
End bp | 3941020 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280265 |
Product | L-sorbosone dehydrogenase |
Protein accession | YP_001545887 |
Protein GI | 159899640 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAAAG CTCTGCTTTT GATCAGTTTG CTTGGTTTGT TGGCCGCTTG TGGCGAGCAA CAAGTCGATG CGCCGATTGC ACCAACCACC GTGCCGCTCA ACCAAGCCGC TGGCAGTTCC ACCACACCTG TTGCCACCGC AACAACTAGT ATTGTACCAA GCCCTGAGCC AACAACAAGT GCAGGCAAGC CAGCACGCAA CGCGCCAACC GCCGTAGTCG AGCCAACCGA TGCGGTTACT TTGCCCGATG GCTTTGGCAT TAGTGTGTTT CAAAGTAAGC TGGCTGGCCC GCGCATGTTG GCGATTGGCC CTGATGGCGC GATTTATACC GCTGAGCGTG GGGAGGATCG GATTGTCCGC TTGCCTGATC GCAACGCCGA TGGTTTGGCT GATGGCGTTG AGGTGATTGC TGATGGCTTC GATTCACCCT CAAGCATGAT TTTCGACCAA GCTGGAAATT TATATGTCGC CGAAACCACC AAAGTGATCA AATTAACCCA GCCTGATGCT GAAGGCAAAT ATACTCAACG CCAAACGATC ATCGATGGCT TGCCTGCTGG CGGCCATAGC ACCCGCACCT TGCTATTCAG CCCTGATGAA AGCAAATTGT ATGTGGCGGT TGGTTCATCG TGCAATGTTT GCAACGAAGA AGATGAGCGA CGGGCAACCG TGATGGAATA TGATCCCGAT GGCAGCAATG GCCGAATTTA TGCCAAGGGC TTACGCAACG CGGTGGGCAT TACTTGGCGG CCTGGCACGA ATGAATTGTG GGCTACCAAC AATGGTCGCG ACATGTTGGG CGACGACCAA CCACCAGAAA CTGTCAACGT GGCAACCAGC GCTGGCCTGG ATTTTGGCTG GCCTCGCTGT CACTCAGGGC GGATTGCCGA CCCTGAATTT GGCAAAGATG CCAATGCCTG CCAAGGTGTT ACGCCGCCTG CGGTCGAGAT GCAAGCCCAC AGCGCTCCGC TCGGTTTGGC ATTTGGCAAC GGCAGCAACT TCCCCGAACC CTATCAAAGC GGCTTGTTTG TGGCTTTCCA CGGCTCATGG AATCGCTCAA GCCCAACGGG TTATAAAGTG GTGTTTATTC CCGTAACTGA TGGCAAAGCT GGCAATGCCC AAGATTTTGC CACTGGCTGG CTGACCGATG CTGGAGCGGT TTGGGGCCGA CCAGTTGATG TAATTGTGGG CCGTGACGGT AGTTTATATA TTTCCGATGA CGCTGGCGGC GCGATTTACC GCGTCTTTGC CAAATAA
|
Protein sequence | MRKALLLISL LGLLAACGEQ QVDAPIAPTT VPLNQAAGSS TTPVATATTS IVPSPEPTTS AGKPARNAPT AVVEPTDAVT LPDGFGISVF QSKLAGPRML AIGPDGAIYT AERGEDRIVR LPDRNADGLA DGVEVIADGF DSPSSMIFDQ AGNLYVAETT KVIKLTQPDA EGKYTQRQTI IDGLPAGGHS TRTLLFSPDE SKLYVAVGSS CNVCNEEDER RATVMEYDPD GSNGRIYAKG LRNAVGITWR PGTNELWATN NGRDMLGDDQ PPETVNVATS AGLDFGWPRC HSGRIADPEF GKDANACQGV TPPAVEMQAH SAPLGLAFGN GSNFPEPYQS GLFVAFHGSW NRSSPTGYKV VFIPVTDGKA GNAQDFATGW LTDAGAVWGR PVDVIVGRDG SLYISDDAGG AIYRVFAK
|
| |