Gene Avin_04400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_04400 
SymbolhoxH 
ID7759399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp417280 
End bp418539 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content70% 
IMG OID643803361 
ProductSoluble nickel-dependent hydrogenase, large subunit, HoxH 
Protein accessionYP_002797671 
Protein GI226942598 
COG category[C] Energy production and conversion 
COG ID[COG3259] Coenzyme F420-reducing hydrogenase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCGCG TCGAGGGCGA GGGCTCCCTC GACCTGCACA TCGAAGGCGA CCGGGTCGTC 
GCGGCGCGGC TCGGCATCTT CGAGCCGCCG CGCTTCTTCG AGGCCTTCCT GCGCGGGCGC
GGCCATGCCG AGGTGGCGGA CATGGTGGCG CGGATCTGTG GCATCTGCCC GGTGGCCTAC
CAGATGAGCG CGGTGCACGC CCTGGAAAAC GCCTTCGGCG TGCGGGTCGA GGGGCAATTG
CGCGCGCTGC GCCGGTTGCT CTACTGCGGC GAGTGGATCG AAAGCCACGC GCTGCATGTG
GTGATGCTGC ACGCCCCGGA CTTTCTCGGC TATCCGGACG CGATCCGCAT GGCGGCCGGG
CACGGCGACC GGGTGCGCGA CGCCCTGGCG CTGAAGAAGG CCGGCAACTC GATAATCCGC
CTGCTCGGCG GGCGCGAGAT CCACCCGGTC AACGTCCGGG TCGGCGGCTT CTACCGCGTG
CCGAGCCGCG CCGAGCTGGC GCCGCTGGCC GAGGAACTGG ATCGGGCCCG CGACATCGCC
GTCGGGCTGG TGCGCTGGGT GGCGGGCTTT CCCTTTCCGC ACATCGAACG GGACTACGAG
TTCGTCGCCC TGCGCCATCC GCACGAATAC CCGCTCAACG AGGGACGGCT GGTATCCAGC
CGCGGCCTCG ATATCGACAT CGCCGATTAC GAGACGGAGT TCGAGGAGCG CCAAGTGCCG
CACTCGACGG CGCTGCACTC GCATCTCAAG CGCCGTGGCG CCTATCTGGT CGGGCCGTTG
GCGCGCTACG CGCTGAACTT CGACCGATTG CCGGAACATA TCCGGGCGCT CGCCGGCGAG
GTCGGCCTCG GTCCGCTGTG CCGCAATCCG TTCCAGAGCA TCGTCGTGCG CGCCCTGGAA
ATCCTCTACG CCTGCGAGGA GGCGCTGGCC ATCATCGCCG CCTACCGGCC GCCGGACATG
GCCTGCGTGC CGCTGGAGCC ACGCGCCGCG ACAGGTTTTG GCTGCACCGA GGCGCCGCGC
GGCACCCTCT GGCACCGTTA TGAACTGTCC GCCGACGGTT CCGTCGAGGC CGCGCGCATC
GTCCCGCCCA CCGCGCAGAA CCAGCCGAGC ATCGAGGCGG ACCTGGAGGC GGTCGCCACG
TCGCTGCTCG ACCAGCCGGA GGAGGTCATC CGTCGGCGCT GCGAGCTGAG CATCCGCAAC
CACGACCCCT GCATCTCCTG CGCGACCCAC TTTCTCAAGC TGTCCGTGCA CCGCGCATGA
 
Protein sequence
MARVEGEGSL DLHIEGDRVV AARLGIFEPP RFFEAFLRGR GHAEVADMVA RICGICPVAY 
QMSAVHALEN AFGVRVEGQL RALRRLLYCG EWIESHALHV VMLHAPDFLG YPDAIRMAAG
HGDRVRDALA LKKAGNSIIR LLGGREIHPV NVRVGGFYRV PSRAELAPLA EELDRARDIA
VGLVRWVAGF PFPHIERDYE FVALRHPHEY PLNEGRLVSS RGLDIDIADY ETEFEERQVP
HSTALHSHLK RRGAYLVGPL ARYALNFDRL PEHIRALAGE VGLGPLCRNP FQSIVVRALE
ILYACEEALA IIAAYRPPDM ACVPLEPRAA TGFGCTEAPR GTLWHRYELS ADGSVEAARI
VPPTAQNQPS IEADLEAVAT SLLDQPEEVI RRRCELSIRN HDPCISCATH FLKLSVHRA