Gene Avin_50500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_50500 
SymbolhoxV 
ID7763899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5117781 
End bp5118827 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content76% 
IMG OID643807879 
Producthydrogenase expression/formation protein HoxV 
Protein accessionYP_002802113 
Protein GI226947040 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.102569 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGCCC TGGAGGCACT GGCCGGACGC CTGCACGTGG AGGTGCGCCT GCAGGACGGT 
GTCATCCGCG CCGTCGATAC CCGACTGCAG CGCCCGCTGC CGCAGATTTC CCGCCTGCTC
GTCGGGCAGA CCGCGGAGGC GGCCCTGAGG CGCCTGCCGC TGCTGTTCGG CCTGTGCGCC
GCGGCGCAGC AGGTGGCGGC ACTACGGGCG CTGGAGCGGG CCGCCGGCTG GGCGGCGATT
GCCGAGGTGG AGGAGGGCCG CACCCGGCTC GGCGAACTGG AGTCGATCCG TGAGTCCCTG
CTGCGCCTGG TGCAGGTCTG GGAGCTGCCT GTGCCCCTGG AGCGGCTCAA GGCGCTGCTC
GCCCTGTGCC GGCGCGCCGC CGCCCGCCTG CAAGCGCTGA CCGCCTTTCG CGCCGCGCCG
TTGCCGGCCG ATGCGGAGCT GGAGGGGACG CTGGCCGCGC TGGCCGCCGC CTGGGCCGAC
CTGCAACCGC CGGCACCGGC CGACTGGCTG CGTCCGCGTC TCGACCGCTG GCAGGAAGTC
GCGCTCGGTG GACCGCCACC GCAGGCCTTC ACAGCGGACG AATTGCCGGC GCTGCTGGCG
CAATTGCGCG CCAGCGACGC GCGCGCCGAG ATCGCCGGGC AGCCGCGGCT CGGCGGGCCG
GCCGCCAGCG CCGGGGCGCA GGCGACGGCG AGCGCGCAGA TCGAGCAGCA CGTCGGCGCG
CTGCTGCGGC GCACGGCGCA GGCGATAGAC TCGCTGCAGT CGCCGCCAGC GCCGCCGGCC
GTGGCCGGGC TGGCGGCGGG AGAGGGTGTC GGCCTGGCGC GGACCGCCCG CGGCTGGCTG
CTGCACCGGG TGTGCCTGGA CGACGGGGCG GTCGGCACCT GGCAACTGCT GGCGCCGACC
GACTGGAATT TCCATGCCGA CGGCCCGCTG CGCCGCCGGC TGTGCGGCGT GCGGGTGGCC
GCCGGGGAGG TCGAGGCGCT GCTGCGCGAA CTGATCCTCG CGCTCGATCC CTGCGTCGCT
TTCGAGGTGA AGATCGTCCA TGCATGA
 
Protein sequence
MSALEALAGR LHVEVRLQDG VIRAVDTRLQ RPLPQISRLL VGQTAEAALR RLPLLFGLCA 
AAQQVAALRA LERAAGWAAI AEVEEGRTRL GELESIRESL LRLVQVWELP VPLERLKALL
ALCRRAAARL QALTAFRAAP LPADAELEGT LAALAAAWAD LQPPAPADWL RPRLDRWQEV
ALGGPPPQAF TADELPALLA QLRASDARAE IAGQPRLGGP AASAGAQATA SAQIEQHVGA
LLRRTAQAID SLQSPPAPPA VAGLAAGEGV GLARTARGWL LHRVCLDDGA VGTWQLLAPT
DWNFHADGPL RRRLCGVRVA AGEVEALLRE LILALDPCVA FEVKIVHA