Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0421 |
Symbol | |
ID | 5732320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 492687 |
End bp | 493778 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277547 |
Product | glucose sorbosone dehydrogenase |
Protein accession | YP_001543200 |
Protein GI | 159896953 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000245782 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGGCTT TTTGGCATAC CTATCAACGG GTCAAACATG GCAGTTGGAG CTTGGTTTTG GGCTTGGCTT TGGTAGCTTG CTCAATTCCA GTCAATTACT CCTCGGATAA TCCACCAACT AGCCCACCAG CCGAACTTCA AATCGCAGCA GGATTCACAA TTACCCCCAT TATTACTGGC CTTGATCAGC CAACTGGACT AGCCTTTGAT CCCCAAGGGC GTTTGTATAT TGCCCAACGC TCAGGCACTG TGCTGATCAG CGACGGAACC AACCCACCCC GCGAGTACGC AACTGATTTC AATAAGCCCT TGAGTTTGCA TTGGTATGAG CAGGCGTTGT ATGTGGTTGA GCAAGGCCAA ATTCAGCGCT TGAGCGACGG CAATAGCGAT GGGATCGTCG ATCAGCAAAC GGTTTTATTG AGCGATCTGC CTGATCAAAT TGAGGCGGCC ACGCTGGTTT CCGATGCGCA GGGTTGGCTC TATCTCGGTG TAGGAACACG TGCTGATCAT ACGGCAACCG CCGGAATTCA AGAGGGTTAT ATTCGGCGTT TTCGCGCCGA TGGCAGCGAA TCGAGCGTGT ATGCAACTGG TTTACGCATG CCGTTTGGCC TAGCCTTCGA CCCACAGCAA CAGCTTTATG CCACCGATAA CGGGCGCGAA GGCCTTGGCG ATGATCTTCC TCCCGATGAG GTAAATCGAA TCACGGCTGG AGCCGATTAT GGTTGGCCGC GCTGTTGGGG CCAGCGCCAG CCCGATGCTG ATGGCGGCGG TGATGCGGCC AACTGTTCGA CCACCAACGA GCCAGTTGCG TTGTTGCCTG CCCATAGCGG TGTAACCGGA ATTGTCTTCT ATCAGGGCAC AGCGTTTCCC GCCAGATATC ACGGCGATGC CTTCATTGGC TTATCCGGTT CATGGTATAG CCAAGAATTA CGGGGCCATA GCATCGTCCG CATGGATGCC GAAACACAGC AGATTGAGTC ATTCGCCAGT GGTTTTGGCC GACCTGTTGG TTTAGCGATT AGCCCGCAAG GCCAATTGTT GGTCGCTGAT TATGACCGTG GCACCATCGT AATGATTGCC GCCCCACCTT AA
|
Protein sequence | MLAFWHTYQR VKHGSWSLVL GLALVACSIP VNYSSDNPPT SPPAELQIAA GFTITPIITG LDQPTGLAFD PQGRLYIAQR SGTVLISDGT NPPREYATDF NKPLSLHWYE QALYVVEQGQ IQRLSDGNSD GIVDQQTVLL SDLPDQIEAA TLVSDAQGWL YLGVGTRADH TATAGIQEGY IRRFRADGSE SSVYATGLRM PFGLAFDPQQ QLYATDNGRE GLGDDLPPDE VNRITAGADY GWPRCWGQRQ PDADGGGDAA NCSTTNEPVA LLPAHSGVTG IVFYQGTAFP ARYHGDAFIG LSGSWYSQEL RGHSIVRMDA ETQQIESFAS GFGRPVGLAI SPQGQLLVAD YDRGTIVMIA APP
|
| |