Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_50440 |
Symbol | hypE |
ID | 7763893 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5111794 |
End bp | 5112819 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643807873 |
Product | hydrogenase expression/formation protein HypE |
Protein accession | YP_002802107 |
Protein GI | 226947034 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0309] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR02124] hydrogenase expression/formation protein HypE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.081583 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGGC TCGATCTCAG GAACGGCAGC GTGGAGATGG TCCACGGCAG CGGCGGCCGC GCCATGGGCC AACTGATCGA GGAACTGTTC GCCCGCGCTC TGCGCAACGA GTGGCTGGAC CAGCGCAACG ATCAGGCGCA GTTCGAGCTG CCGCCCGGCC GGGTAGTGAT GGCCACCGAC AGCCACGTGA TCTCGCCGCT GTTCTTCCCC GGCGGCGACA TCGGCAGCCT CGCCGTGCAC GGCACCATCA ACGACGTGGC CATGGCCGGC GCCCGACCCT GCTACCTGGC CGCCGGCTTC ATCCTCGAGG AGGGTTTTCC GCTCGCCGAC CTGGCGCGCA TCGTCGAGTC GATGGCCGCC GCCGCCCGCG AGGCCGGCGT GCCGGTGGTC ACCGGCGACA CCAAGGTGGT CGAACACGGC AAGGGCGACG GGGTGTTCAT CACCACCACC GGGGTCGGCG TGGTGCCGCC GGGCTTGCAC CTCTCCGGCG ACCAGGCGCG GCCGGGCGAC CGCATCCTGC TTTCCGGCAG CATCGGCGAC CACGGCGTGA CCATCCTCTC GTTGCGCGAG GGCCTCGGCT TCGAGGCCGA CATCGGCTCC GACTCCCAGG CCCTGCACGG CCTGGTCGCC GCCATGCTCG CGGCGGTGCC GGAGATCCGC TGCCTGCGCG ACCCGACCCG CGGCGGCCTG GGCAACACCC TCAACGAACT GGCCCGCCAG TCGGGGGTCG GCATGCAACT CGTCGAGCGG GCCATCCCGG TGCGCGAACC GGTGCGTGCC GCCTGCGAAT TCCTCGGCCT CGACCCGCTG TACGTGGCCA ACGAAGGCAA GCTGATCGCC ATCTGCCCGG CCGAGCGGGC CGAGCGCCTG CTCGAAGCGA TGCGTGCGCA TCCGCAGGGG CGCGAGGCGG CGATCATCGG CACGGTGGTG GCCGACGAGC ACCGTTTCGT GCAGATGGAG ACACCGTTCG GTGGGAGTCG GATGGTGGAC TGGTTGAGCG GGGAGCAGTT GCCGAGGATT TGCTGA
|
Protein sequence | MSRLDLRNGS VEMVHGSGGR AMGQLIEELF ARALRNEWLD QRNDQAQFEL PPGRVVMATD SHVISPLFFP GGDIGSLAVH GTINDVAMAG ARPCYLAAGF ILEEGFPLAD LARIVESMAA AAREAGVPVV TGDTKVVEHG KGDGVFITTT GVGVVPPGLH LSGDQARPGD RILLSGSIGD HGVTILSLRE GLGFEADIGS DSQALHGLVA AMLAAVPEIR CLRDPTRGGL GNTLNELARQ SGVGMQLVER AIPVREPVRA ACEFLGLDPL YVANEGKLIA ICPAERAERL LEAMRAHPQG REAAIIGTVV ADEHRFVQME TPFGGSRMVD WLSGEQLPRI C
|
| |