Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4700 |
Symbol | |
ID | 5736547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6004945 |
End bp | 6005970 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281864 |
Product | glyceraldehyde-3-phosphate dehydrogenase, type I |
Protein accession | YP_001547459 |
Protein GI | 159901212 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase |
TIGRFAM ID | [TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCACGAG TAGCAATCAA CGGCTTCGGT CGCATCGGTC GCCAAAGCTT CAAAACCATC TTGGATCACT ATCGCGATGA GTTGGAAATT GTTGCAATCA ACGACCTGAC CGATAACCAA ACGTTGGCCC ACCTGCTCAA GTACGATTCA ACCTATGGTT CATTCGATGG CGATGTTACC GCCACTGAAG ATACAATCAG CGTTACCTTC GCCGATGAAG AAGAGCCAAT GGTCATCAAG GCCTTGGCCG AGCGCGATCC GTCAAAATTA CCATGGGGTG AGCTCAACGT CGATATCGTA ATCGAATCGA CTGGGATTTT TACTGATGCG ACCAAAGCCA AAATGCACTT AGATGCAGGC GCGAAGAAAG TCATCATCAG TGCTCCCGCC AAGAACGAAG ACATCACCGT GTGTATGGGC GTGAACGAAG AAAATTACGA CGCTGAAAAA CATACGATTG TCTCGAATGC ATCATGTACC ACCAACTGTT TGGCTCCAGT AGCTAAAGTG TTGAACGATA AGTTCGGCAT TGTGCGTGGC TTGATGACCA CCATTCACTC ATACACGATG GACCAAAATT TGCAAGACGG CCCCCACAAA GATTTGCGCC GCGCTCGCGC TGCTGCTTTG AACATGGTTC CAACCACCAC GGGTGCTGCT AAAGCTGTAG CTTTGGTTAT CCCTGAGTTG AAGGGCAAAT TCGATGGTTT CGCCGTGCGC GTCCCAACCC CAACCGTTTC GATGGTTGAC TTTGTGGTTG AATTGGCCGA AGGCGCAACC GTTGAATCGA TCAACAATGC CTTTATTGAA GCCTCAGAAG GCGCAATGGA AGGCGTGTTG GGCGTAACCA ATGAAGAACT GGTCTCATCA GACTTCATCG GTACTTCATA CTCAAGCGTG GTTGACTTAA GCTTGACCAT GGCAATGGGC GACAAGATGG TCAAAGTTGT TGCTTGGTAC GACAACGAAT GGGGCTATGC TACCCGCATC GCTGACCTCA CCGCCTATAT CGCTGAAAAA CTCTAA
|
Protein sequence | MARVAINGFG RIGRQSFKTI LDHYRDELEI VAINDLTDNQ TLAHLLKYDS TYGSFDGDVT ATEDTISVTF ADEEEPMVIK ALAERDPSKL PWGELNVDIV IESTGIFTDA TKAKMHLDAG AKKVIISAPA KNEDITVCMG VNEENYDAEK HTIVSNASCT TNCLAPVAKV LNDKFGIVRG LMTTIHSYTM DQNLQDGPHK DLRRARAAAL NMVPTTTGAA KAVALVIPEL KGKFDGFAVR VPTPTVSMVD FVVELAEGAT VESINNAFIE ASEGAMEGVL GVTNEELVSS DFIGTSYSSV VDLSLTMAMG DKMVKVVAWY DNEWGYATRI ADLTAYIAEK L
|
| |