Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_50590 |
Symbol | hoxK |
ID | 7763908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5124351 |
End bp | 5125427 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643807888 |
Product | Uptake hydrogenase small subunit (Precursor), HoxK |
Protein accession | YP_002802122 |
Protein GI | 226947049 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCGAC TCGAAACTTT CTATGACGTG ATGCGGCGTC AGGGCATCAC GCGCCGCAGC TTTCTCAAAT ATTGCAGCCT GACCGCCGCG GCCCTGGGCC TCGGCCCGGC CTTCGCCCCG CGGATCGCCC ACGCGATGGA AACCAAGCCG CGCACTCCGG TGCTCTGGCT GCACGGCCTG GAGTGCACCT GCTGCTCCGA GTCGTTCATC CGTTCGGCCC ACCCGCTGGT CAAGGACGTG GTGCTGTCGA TGATCTCGCT GGACTACGAC GACACCCTGA TGGCCGCCGC CGGCCACCAG GCCGAGGCCG CCCTCGAAGA GACCATGCGC AAGTACAAGG GCGAGTACAT CCTCGCCGTG GAGGGCAACC CGCCGCTCAA CGAGGACGGC ATGTTCTGCA TCGTCGGCGG CAAGCCGTTC ATCGAGCAGC TCAGGCATGT GGCGAAGGAC GCCAAGGCGG TGATCGCCTG GGGCAGTTGC GCCAGTTGGG GCTGCGTGCA GGCGGCCCGG CCCAACCCGA CCCAGGCGGT GCCGATCCAC AAGGTCATCA CCGACAAGCC GATCGTCAAG GTGCCCGGCT GCCCGCCGAT CGCCGAGGTG ATGACCGGGG TGATCACCTA CATGCTGACC TTCGGCAAGC TGCCCGAGCT GGACCGCCAG GGGCGGCCGA AGATGTTCTA CGGCCAGCGC ATCCACGACA AGTGCTACCG CCGCCCGCAC TTCGACGCCG GCCAGTTCGT CGAGCACTGG GACGACGAGG GCGCGCGCAA GGGCTACTGC CTGTACAAGG TCGGCTGCAA GGGCCCGACC AGCTACAACG CCTGCTCGAC GGTGCGCTGG AACGAGGGCA CTTCCTTCCC GATCCAGGCC GGCCACGGCT GCATCGGCTG CTCGGAGGAC GGTTTCTGGG ACAAGGGCTC GTTCTATGAA CGCCTGACCA CCATTCCGCA GTTCGGCATC GAGAAGAACG CCGACGAAAT CGGCGCCGCC GTCGCCGGCG GGGTCGGCGC GGCCATCGCC GCGCATGCCG CGGTCACCGC CATCAAGCGC CTGCAGAACA AGGGGGATCG CCCATGA
|
Protein sequence | MSRLETFYDV MRRQGITRRS FLKYCSLTAA ALGLGPAFAP RIAHAMETKP RTPVLWLHGL ECTCCSESFI RSAHPLVKDV VLSMISLDYD DTLMAAAGHQ AEAALEETMR KYKGEYILAV EGNPPLNEDG MFCIVGGKPF IEQLRHVAKD AKAVIAWGSC ASWGCVQAAR PNPTQAVPIH KVITDKPIVK VPGCPPIAEV MTGVITYMLT FGKLPELDRQ GRPKMFYGQR IHDKCYRRPH FDAGQFVEHW DDEGARKGYC LYKVGCKGPT SYNACSTVRW NEGTSFPIQA GHGCIGCSED GFWDKGSFYE RLTTIPQFGI EKNADEIGAA VAGGVGAAIA AHAAVTAIKR LQNKGDRP
|
| |