Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_50500 |
Symbol | hoxV |
ID | 7763899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5117781 |
End bp | 5118827 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 643807879 |
Product | hydrogenase expression/formation protein HoxV |
Protein accession | YP_002802113 |
Protein GI | 226947040 |
COG category | [C] Energy production and conversion |
COG ID | [COG0374] Ni,Fe-hydrogenase I large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.102569 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGCCC TGGAGGCACT GGCCGGACGC CTGCACGTGG AGGTGCGCCT GCAGGACGGT GTCATCCGCG CCGTCGATAC CCGACTGCAG CGCCCGCTGC CGCAGATTTC CCGCCTGCTC GTCGGGCAGA CCGCGGAGGC GGCCCTGAGG CGCCTGCCGC TGCTGTTCGG CCTGTGCGCC GCGGCGCAGC AGGTGGCGGC ACTACGGGCG CTGGAGCGGG CCGCCGGCTG GGCGGCGATT GCCGAGGTGG AGGAGGGCCG CACCCGGCTC GGCGAACTGG AGTCGATCCG TGAGTCCCTG CTGCGCCTGG TGCAGGTCTG GGAGCTGCCT GTGCCCCTGG AGCGGCTCAA GGCGCTGCTC GCCCTGTGCC GGCGCGCCGC CGCCCGCCTG CAAGCGCTGA CCGCCTTTCG CGCCGCGCCG TTGCCGGCCG ATGCGGAGCT GGAGGGGACG CTGGCCGCGC TGGCCGCCGC CTGGGCCGAC CTGCAACCGC CGGCACCGGC CGACTGGCTG CGTCCGCGTC TCGACCGCTG GCAGGAAGTC GCGCTCGGTG GACCGCCACC GCAGGCCTTC ACAGCGGACG AATTGCCGGC GCTGCTGGCG CAATTGCGCG CCAGCGACGC GCGCGCCGAG ATCGCCGGGC AGCCGCGGCT CGGCGGGCCG GCCGCCAGCG CCGGGGCGCA GGCGACGGCG AGCGCGCAGA TCGAGCAGCA CGTCGGCGCG CTGCTGCGGC GCACGGCGCA GGCGATAGAC TCGCTGCAGT CGCCGCCAGC GCCGCCGGCC GTGGCCGGGC TGGCGGCGGG AGAGGGTGTC GGCCTGGCGC GGACCGCCCG CGGCTGGCTG CTGCACCGGG TGTGCCTGGA CGACGGGGCG GTCGGCACCT GGCAACTGCT GGCGCCGACC GACTGGAATT TCCATGCCGA CGGCCCGCTG CGCCGCCGGC TGTGCGGCGT GCGGGTGGCC GCCGGGGAGG TCGAGGCGCT GCTGCGCGAA CTGATCCTCG CGCTCGATCC CTGCGTCGCT TTCGAGGTGA AGATCGTCCA TGCATGA
|
Protein sequence | MSALEALAGR LHVEVRLQDG VIRAVDTRLQ RPLPQISRLL VGQTAEAALR RLPLLFGLCA AAQQVAALRA LERAAGWAAI AEVEEGRTRL GELESIRESL LRLVQVWELP VPLERLKALL ALCRRAAARL QALTAFRAAP LPADAELEGT LAALAAAWAD LQPPAPADWL RPRLDRWQEV ALGGPPPQAF TADELPALLA QLRASDARAE IAGQPRLGGP AASAGAQATA SAQIEQHVGA LLRRTAQAID SLQSPPAPPA VAGLAAGEGV GLARTARGWL LHRVCLDDGA VGTWQLLAPT DWNFHADGPL RRRLCGVRVA AGEVEALLRE LILALDPCVA FEVKIVHA
|
| |