Gene Avin_50590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_50590 
SymbolhoxK 
ID7763908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5124351 
End bp5125427 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content67% 
IMG OID643807888 
ProductUptake hydrogenase small subunit (Precursor), HoxK 
Protein accessionYP_002802122 
Protein GI226947049 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCGAC TCGAAACTTT CTATGACGTG ATGCGGCGTC AGGGCATCAC GCGCCGCAGC 
TTTCTCAAAT ATTGCAGCCT GACCGCCGCG GCCCTGGGCC TCGGCCCGGC CTTCGCCCCG
CGGATCGCCC ACGCGATGGA AACCAAGCCG CGCACTCCGG TGCTCTGGCT GCACGGCCTG
GAGTGCACCT GCTGCTCCGA GTCGTTCATC CGTTCGGCCC ACCCGCTGGT CAAGGACGTG
GTGCTGTCGA TGATCTCGCT GGACTACGAC GACACCCTGA TGGCCGCCGC CGGCCACCAG
GCCGAGGCCG CCCTCGAAGA GACCATGCGC AAGTACAAGG GCGAGTACAT CCTCGCCGTG
GAGGGCAACC CGCCGCTCAA CGAGGACGGC ATGTTCTGCA TCGTCGGCGG CAAGCCGTTC
ATCGAGCAGC TCAGGCATGT GGCGAAGGAC GCCAAGGCGG TGATCGCCTG GGGCAGTTGC
GCCAGTTGGG GCTGCGTGCA GGCGGCCCGG CCCAACCCGA CCCAGGCGGT GCCGATCCAC
AAGGTCATCA CCGACAAGCC GATCGTCAAG GTGCCCGGCT GCCCGCCGAT CGCCGAGGTG
ATGACCGGGG TGATCACCTA CATGCTGACC TTCGGCAAGC TGCCCGAGCT GGACCGCCAG
GGGCGGCCGA AGATGTTCTA CGGCCAGCGC ATCCACGACA AGTGCTACCG CCGCCCGCAC
TTCGACGCCG GCCAGTTCGT CGAGCACTGG GACGACGAGG GCGCGCGCAA GGGCTACTGC
CTGTACAAGG TCGGCTGCAA GGGCCCGACC AGCTACAACG CCTGCTCGAC GGTGCGCTGG
AACGAGGGCA CTTCCTTCCC GATCCAGGCC GGCCACGGCT GCATCGGCTG CTCGGAGGAC
GGTTTCTGGG ACAAGGGCTC GTTCTATGAA CGCCTGACCA CCATTCCGCA GTTCGGCATC
GAGAAGAACG CCGACGAAAT CGGCGCCGCC GTCGCCGGCG GGGTCGGCGC GGCCATCGCC
GCGCATGCCG CGGTCACCGC CATCAAGCGC CTGCAGAACA AGGGGGATCG CCCATGA
 
Protein sequence
MSRLETFYDV MRRQGITRRS FLKYCSLTAA ALGLGPAFAP RIAHAMETKP RTPVLWLHGL 
ECTCCSESFI RSAHPLVKDV VLSMISLDYD DTLMAAAGHQ AEAALEETMR KYKGEYILAV
EGNPPLNEDG MFCIVGGKPF IEQLRHVAKD AKAVIAWGSC ASWGCVQAAR PNPTQAVPIH
KVITDKPIVK VPGCPPIAEV MTGVITYMLT FGKLPELDRQ GRPKMFYGQR IHDKCYRRPH
FDAGQFVEHW DDEGARKGYC LYKVGCKGPT SYNACSTVRW NEGTSFPIQA GHGCIGCSED
GFWDKGSFYE RLTTIPQFGI EKNADEIGAA VAGGVGAAIA AHAAVTAIKR LQNKGDRP