Gene Avin_50580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_50580 
SymbolhoxG 
ID7763907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5122546 
End bp5124354 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content67% 
IMG OID643807887 
ProductMembrane bound nickel-dependent hydrogenase, large subunit, HoxG 
Protein accessionYP_002802121 
Protein GI226947048 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.980558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGCC TGCCGAACGC CAGCCAACTG GACAAGTCCG GCAGGCGCAT CGTCGTCGAC 
CCGGTGACCC GCATCGAGGG CCACATGCGC TGCGAGGTCA ACGTCGACGC CAGCAACGTG
ATCACCAACG CCGTCTCCAC CGGCACCATG TGGCGCGGCC TGGAGGTCAT CCTCAAGGGC
CGCGACCCGC GCGACGCCTG GGCCTTCGTC GAGCGCATCT GCGGCGTCTG CACCGGCACC
CATGCGCTGA CCTCGGTGCG CGCGGTGGAG GATGCCCTGG ACATCCGCAT CCCCTACAAC
GCCCACCTGA TCCGCAACCT GATGGACAAG ACGCTGCAGG TGCACGACCA CATCGTGCAC
TTCTACCACC TGCACGCGCT GGACTGGGTC AACCCGGTCA ACGCCCTGAA GGCCGATCCC
AAGGCTACCT CCGCCCTGCA GCAGGCGGTT TCGCCGGCCC ATGCCAAGTC CAGCCCCGGC
TACTTCCGCG ACGTGCAGAC GCGCCTGAAG AAGTTCGTCG AGAGCGGCCA GCTCGGCCTG
TTCTCCAACG GCTACTGGGA CAATCCGGCC TACAAGCTGC CGCCCGAGGC GGACCTGATG
GCCGTGGCCC ACTACCTGGA GGCGCTGGAC CTGCAGAAGG ACATCGTCAA GATCCATACC
ATCTTCGGCG GCAAGAACCC GCATCCGAAC TACATGGTCG GCGGCGTGGC CTGCGCCATC
AACCTGGACG ACGTCGGCGC CGCCGGCGCG CCGGTCAACA TGACCAGCCT GAACTTCGTC
CTCGAACGCA TCCACGAGGC CCGCGAGTTC ACCAGGAACG TCTACCTGCC GGACGTGCTG
GCGGTCGCCG GGATCTACAA GGACTGGCTG TACGGCGGCG GTCTGGCCGC GCACAACCTG
CTGTCCTACG GCACCTTCAC CAAGGTGCCC TACGACAAGT CCAGCGACCT GTTGCCGGCC
GGCGCCATCG TCGGCGGCAA TTGGGACGAG GTGCTGCCGG TCGACGTGCG CGATCCCGAG
GAGATCCAGG AGTTCGTCAG CCACTCCTGG TACAGCTACG CCGACGAAAC CAAGGGGCTG
CATCCCTGGG ACGGCGTCAC CGAGCCGAAA TTCGAGCTCG GCCCGAACAC CAAGGGCAGC
CGCACCCACA TCCAGGAAAT CGACGAGGCG CACAAGTACA GCTGGATCAA GGCGCCGCGC
TGGCGCGGCC ACGCTATGGA GGTCGGCCCG CTGGCACGTT ACATCATCGC CTACGCTTCG
GGCCGCGAAT ACGTGAAGGA ACAGGTCGAC CGCTCGCTGG CCGCCTTCAA CCAGAGCACC
GGCCTGAACC TCGGCCTCAA GCAGTTCCTG CCCTCGACCC TCGGCCGCAC CCTGGCGCGC
GCCCTGGAGT GCGAGCTGGC GGTGGACAGC ATGCTCGACG ACTGGCAGGC CCTGGTCGGC
AACATCAAGG CCGGCGACCG CGCCACCGCC AACGTCGAGA AGTGGGACCC GAGCACCTGG
CCGAAGGAGG CCAAGGGCGT GGGCATCAAC GAGGCGCCGC GCGGCGCCCT GGGCCACTGG
ATCAGGATCA AGGACGGCAA GATCGAGAAC TACCAGGCGA TCGTGCCGAC CACCTGGAAC
GGCACCCCGC GCGACCATCT GGGCAACATC GGCGCCTACG AGGCCGCGCT GCTCAACACC
AGGATGGAGC GCCCGGACGA GCCGGTGGAG ATCCTGCGCA CCCTGCACAG CTTCGACCCC
TGCCTGGCCT GTTCGACCCA CGTGATGTCG CCGGACGGCC AGGAGCTGAC CCGGGTGAAG
GTCCGCTGA
 
Protein sequence
MSSLPNASQL DKSGRRIVVD PVTRIEGHMR CEVNVDASNV ITNAVSTGTM WRGLEVILKG 
RDPRDAWAFV ERICGVCTGT HALTSVRAVE DALDIRIPYN AHLIRNLMDK TLQVHDHIVH
FYHLHALDWV NPVNALKADP KATSALQQAV SPAHAKSSPG YFRDVQTRLK KFVESGQLGL
FSNGYWDNPA YKLPPEADLM AVAHYLEALD LQKDIVKIHT IFGGKNPHPN YMVGGVACAI
NLDDVGAAGA PVNMTSLNFV LERIHEAREF TRNVYLPDVL AVAGIYKDWL YGGGLAAHNL
LSYGTFTKVP YDKSSDLLPA GAIVGGNWDE VLPVDVRDPE EIQEFVSHSW YSYADETKGL
HPWDGVTEPK FELGPNTKGS RTHIQEIDEA HKYSWIKAPR WRGHAMEVGP LARYIIAYAS
GREYVKEQVD RSLAAFNQST GLNLGLKQFL PSTLGRTLAR ALECELAVDS MLDDWQALVG
NIKAGDRATA NVEKWDPSTW PKEAKGVGIN EAPRGALGHW IRIKDGKIEN YQAIVPTTWN
GTPRDHLGNI GAYEAALLNT RMERPDEPVE ILRTLHSFDP CLACSTHVMS PDGQELTRVK
VR