Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_14580 |
Symbol | gapB |
ID | 7760394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1439606 |
End bp | 1440991 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643804356 |
Product | glyceraldehyde-3-phosphate dehydrogenase |
Protein accession | YP_002798649 |
Protein GI | 226943576 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase |
TIGRFAM ID | [TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0654455 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCCGC TGATCGGTCA GCTCTATCGC AACAACAACG TGGTGACCTC GATTTTCGGA CGTCCCCTGA TCAACCAGTC CGTGATCGCG GTCCTCAAGG CCCATCGCTT CGCCCGCCAC CGCCAGACCG ACGAGGCCGA GCTTTCGGTG CACGAGACCT TCCCGATCCT CAAGGCCATG AGCGAGCTGA AGCTCGGCGC CGCTTCCGTC GATCTGGGCA AGATGGTCGG CAAGTACCGG GAGGCCGGTG CCGGTCGCTC CGTCGAGCAG TTCGTGCGCG ACGAGCTGTC CGAGGTCGTC GGCAGGCAGA ACAGCATCCA GCACAAGGGC AGCGACGTGG TGCTCTATGG CTTCGGCCGC ATCGGCCGCC TGCTGGCGCG CATCCTGATC CAGAAGACCG GCGGCGGCGA CGGACTGCGC CTGCGCGCCA TCGTCGTGCG CAAGGGCGCC GACAACGATC TGGCCAAGCG CGCCAGCCTG TTGCGCCGTG ACTCGGTGCA CGGCCCGTTC GAAGGCACCA TCACCGTGGA CGAGGAAAGC AATACCATCC TCGCCAACGG CAACCTGATC CAGGTGATCT ACTCCAACGA CCCGACCTCC GTCGATTACA CCCAGTACGG CATCCAGAGC GCCATCGTGG TCGACAACAC CGGCAAGTGG CGCGACGCCG AAGGCCTCGG CCAGCATCTG AAGTGCCCGG GCGTCGCCCG CGTGGTACTG ACCGCTCCGG GCAAGGGGGC GCTGAAGAAC ATCGTGCATG GCATCAACCA TCGCGACATC ACCGACGAGG ACAAGATCGT CTCCGCGGCT TCCTGCACCA CCAATGCCAT CGTGCCGGTG CTGAAGGCGG TGAACGACAA GTTCGGCATC GTCAATGGTC ACGTGGAGAC CGTGCACTCC TACACCAACG ACCAGAACCT GATCGACAAC TTCCACAAGG GCAGCCGCCG CGGTCGAAGC GCTCCGCTGA ACATGGTGAT CACCGAGACC GGGGCGGCCA CCGCCGCGGC CAAGGCCCTG CCCGAGTTGA AGGGCAAGCT GACCGGCAAC GCGATTCGCG TGCCGACGCC GAACGTTTCC ATGGCGATCC TCAACCTGAA CCTGGAAAAG TCCACCAGCC GCGACGAGAT CAACGAATAC CTGCGTCAGG TGGCGATGTT CTCCGAACTG CACAAGCAGA TCGACTTCGT CAACTCGTCC GAGGTGGTAT CCACCGACTT CGTCGGTTCC CGCCATGCCG GGGTGGTGGA TGCCGAAGCC ACCATCTGCA CCGACAATCG AGTCGTGCTC TATGTCTGGT ACGACAACGA GTTCGGCTAC AGTTGCCAGG TGGTACGCGT GCTCGAGGAA ATAGCCGGAG TCAATCCGCC GGTTTTCCCC AACTGA
|
Protein sequence | MIPLIGQLYR NNNVVTSIFG RPLINQSVIA VLKAHRFARH RQTDEAELSV HETFPILKAM SELKLGAASV DLGKMVGKYR EAGAGRSVEQ FVRDELSEVV GRQNSIQHKG SDVVLYGFGR IGRLLARILI QKTGGGDGLR LRAIVVRKGA DNDLAKRASL LRRDSVHGPF EGTITVDEES NTILANGNLI QVIYSNDPTS VDYTQYGIQS AIVVDNTGKW RDAEGLGQHL KCPGVARVVL TAPGKGALKN IVHGINHRDI TDEDKIVSAA SCTTNAIVPV LKAVNDKFGI VNGHVETVHS YTNDQNLIDN FHKGSRRGRS APLNMVITET GAATAAAKAL PELKGKLTGN AIRVPTPNVS MAILNLNLEK STSRDEINEY LRQVAMFSEL HKQIDFVNSS EVVSTDFVGS RHAGVVDAEA TICTDNRVVL YVWYDNEFGY SCQVVRVLEE IAGVNPPVFP N
|
| |