Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3737 |
Symbol | |
ID | 5735601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4695939 |
End bp | 4697339 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280889 |
Product | glucose/sorbosone dehydrogenase-like |
Protein accession | YP_001546501 |
Protein GI | 159900254 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCTGA ATATTGCAAC CTTGGTGCTG CTATCACTTT GGCTGGGGGC GATGATTGGC GGATTATTGC TTATGCGCCA AGCAGGCCGT TGGCGCTGGA TTGGCCTCGC AATTGCCGTG CAAGGCTTGA TCTTGCTGGT ATTTCAAGGC TTGCTCAGCA CTTATCCAAT TGGGATTGGC GATCCATTAA TTCGGGTGAA ATTTTGGGAT TTGCTGAGCC AAGCGGCGGC CGTGCTTGGT GCATTAGTGC TTGTCGCATT TGTGTTATGG TGGCTAGCCA AACAAGTGCG CTACAGCATC GATCAGCAAT TGCACCGACG AGGCTTAGCC TTATTGTTGC TCTTTGGCAT GCCATTAATC GCCACGCCCG CCTTTTACGC CATGTGGCAA ACCAGCATTC CCGAGCGCGA GCGTGAACGC AACCCTGATT TGCGGGTGAT CAACGTGCCC GAAGGCTTTG AATGGAGCGT CTACGCCCGT GGCACGATGG ATAATCCTAC GGCGATTGCC TTTGGAGCAG AAAACGAACT ATACATCGCC GATATTGCTG GCGATTTGTG GATTGGCCGC GACCAAAATA ATGATCAGCA GATTGATAGT TTGAGCAAAT GGGCTGGCGA TTTTGATTTG TTGGTCGGCT TGGTTTGGCG CGATGGCGAG TTGTATTGTG CTAGTTCAGG CAAAATCGAG GCCTTGCGCG ATAGCGATGG CGATGGCGTA GCCGATAGTC GCCGCATCGT GGTTGATAAT TTGCCCTCAA TGATTTTGCA GCCACACTCC AACAACGGCT TGGCCTTTGG CCCCGATGGC CGCTTGTATT TTGGGGTTGG CAGCACCACC GACGGTAAAT TTGAGGAAAA TGAATTAGCC GCCAGCGTTT TATCGGTTAA TCCCGATGGC ACTGATTTGC GCCCCTATGC GCGGGGCTTG GGCAATGTGT TTGATGTCGC TTTCAATGCC GATGGGGCGC TGTTTGGGGG CGATAATGGC CCTAGCTCAG TTGAGGGCAA TGATCCGCCA GATGAATTTA ATTATTTGGT TGAAGGCGAA CACTACGGCT ACCCCTACTT TTTTGGCGAC CCGCCCAGCG ACGGTGGCAC ACGCGGCGCA TTGATCAGCT TTCCCGCTCA CTCGGTGCCA ACGGGCGTTA CGTTTTATAG TGGCAATCAA TATCCGCAAA TCTATAGCGA TAGCGCCTTT TTGACGCTGT GGCAGACGGG CGAGGTCGTA CACATTGAAG TTGGGCAAAC CAGCAACGGC GATTATCTGG CCAAATCAAC CACCTTTGCT GATGGTATGC TCTACCCGAT TGATGTGATT ACTGGCCCCG ATGGCAACTT GTATATCGCC GATTTCGGTA CGAGTGCAAT CTACCGAATC ACCTATAATG GAGTGCGTTA A
|
Protein sequence | MLLNIATLVL LSLWLGAMIG GLLLMRQAGR WRWIGLAIAV QGLILLVFQG LLSTYPIGIG DPLIRVKFWD LLSQAAAVLG ALVLVAFVLW WLAKQVRYSI DQQLHRRGLA LLLLFGMPLI ATPAFYAMWQ TSIPERERER NPDLRVINVP EGFEWSVYAR GTMDNPTAIA FGAENELYIA DIAGDLWIGR DQNNDQQIDS LSKWAGDFDL LVGLVWRDGE LYCASSGKIE ALRDSDGDGV ADSRRIVVDN LPSMILQPHS NNGLAFGPDG RLYFGVGSTT DGKFEENELA ASVLSVNPDG TDLRPYARGL GNVFDVAFNA DGALFGGDNG PSSVEGNDPP DEFNYLVEGE HYGYPYFFGD PPSDGGTRGA LISFPAHSVP TGVTFYSGNQ YPQIYSDSAF LTLWQTGEVV HIEVGQTSNG DYLAKSTTFA DGMLYPIDVI TGPDGNLYIA DFGTSAIYRI TYNGVR
|
| |