Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3022 |
Symbol | |
ID | 5734879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3817801 |
End bp | 3818913 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280166 |
Product | glucose-6-P dehydrogenase subunit-like |
Protein accession | YP_001545788 |
Protein GI | 159899541 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3429] Glucose-6-P dehydrogenase subunit |
TIGRFAM ID | [TIGR00534] opcA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGGCG AAATTTCCAA TAGCCCACGT CCAGCCAATG TTGCTGAGAT TGAAGATCAA CTGCGCGATT TGTGGCGTGA ATTGGGCGAT CAGCATCGCG ATGAACATTA TGTGATGACC CGCGTTTGCA CCATGACTGT GATTGGCTAT GGTGCAAACC AAACCCTCGC CAAACGAGTA CGCACGGCCT TGCCGCAAGT GTTCGGCGTG CATCCATGCC GCGCAATTTT GATCGAAACT GGCAGCGAGG CCGAAGAACT CAGCGCATGG GTCAGCACCG TTTGTCAACC AAGTAGCGAA GAACACGAGC AAGTTTGTTG CGAACAAATT ACCTTCTGTG TTGGCGAGCA AATGCGGCGA CGTTTGCCCG GCACAGTGCT ACCTTTGGTC GTTTCCGATT TGCCCTTGTT TGTCTGGTGG CCTGGGCCGT TGCACCCTGC TAGCAATGTG CGTACCCAGT TGTTTGCTCA TGCTGATCGC TGGATTATGG ATTCGGCGGA TTTTCTCGAC CCGCTGCCCG ATTTGGCGCG TTTGCACAAG ATGGTTATCA GCGATCAAAC CGATGCGGTC AGCGATTTGA CTTGGGCACG CTTGACCCCG TGGCGCACTG CGTTTGCCCA AATTTTCGAT GCAATGGCCA TGCGCCCAGT GCTCGAAAAC CTTAGCAACA TCAAAGTAAC CACCGGACGA CATCAAGCAG CTGGCTTGCT GAGCATCGGC TGGATGGCAA CCTGCCTTGA TTGGCAGTTG ATCAGCGCGA GTGGCAATAG CGAAACCTTG CTTTGCAATT TCAAACATCG CAATGGCACG GTTACGATCA GCATGCAAAC CAGCGATCGA ACTGGCGAGG AAGTGCCATT TATCGAAGTA CAGGCCGCCA AACATAAAGC CAGTATCACC GTAGGCCGCA GCAGCAACAA ACAAGCCTTG CTAGCCAATG TGCATCTTGA TGGTGAAACT CGTTGTCAGA TGAGCGAACT ACCAAGGCCC AGCGATAGCC TGTTGCTCTT GAATGAACTA AATATGTATA GTCATGATCG AGTTTATGAA CGGGCTTTAG CGATTGTCGC CGCAATTGCT CAAGCAACCG ATCCCCATGG AGCACATGTA TGA
|
Protein sequence | MMGEISNSPR PANVAEIEDQ LRDLWRELGD QHRDEHYVMT RVCTMTVIGY GANQTLAKRV RTALPQVFGV HPCRAILIET GSEAEELSAW VSTVCQPSSE EHEQVCCEQI TFCVGEQMRR RLPGTVLPLV VSDLPLFVWW PGPLHPASNV RTQLFAHADR WIMDSADFLD PLPDLARLHK MVISDQTDAV SDLTWARLTP WRTAFAQIFD AMAMRPVLEN LSNIKVTTGR HQAAGLLSIG WMATCLDWQL ISASGNSETL LCNFKHRNGT VTISMQTSDR TGEEVPFIEV QAAKHKASIT VGRSSNKQAL LANVHLDGET RCQMSELPRP SDSLLLLNEL NMYSHDRVYE RALAIVAAIA QATDPHGAHV
|
| |