Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_12940 |
Symbol | hisC |
ID | 7760236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1257673 |
End bp | 1258728 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643804196 |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_002798495 |
Protein GI | 226943422 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCAAAT TCTGGAGTCC CTTCGTCAAG AACCTGGTGC CCTATGTGCC AGGCGAACAG CCGAAGCTGA GCAAGCTGGT CAAGCTCAAT ACCAACGAAA ATCCCTACGG GCCGTCGCCG CGGGCGATCG CCGCCATGCA GGCCGAGCTG AACGATGGCC TGCGTCTGTA TCCCGATCCC AACGGCGAGC GCCTGAAGCA GGCGATCGCC GACTACTATG GAGTGCGGAG CAGTCAGGTG TTCGTCGGCA ATGGCTCCGA CGAGGTGCTG GCGCACATCT TCCATGGGCT GTTCCAGCAT GGCCGGCCGC TATTGTTCCC GGATGTGACC TACAGTTTCT ATCCGGTGTA CTGTGGTCTC TACGCAATTC CCTTCGAGAC CGTTGCGCTG GACGAGGAGT TTCGCATTCG CGTCGAAGAC TTTGTGCGTC CCAACGGAGG CATCATCTTC CCCAATCCCA ATGCTCCGAC AGGCTGCCTG CTGCCGCTCG AGTCCATCGA GCGGCTGCTC GAGAGCAATC CGGACTCGGT GGTGGTGGTG GATGAGGCCT ATGTGGATTT CGGTGGCGAG ACGGCGATCG GTCTCGTTGA CCGGCATGCC AATCTCCTGG TGACCCAGAC CCTGTCCAAG TCGCGTTCCC TGGCCGGGCT GAGGGTCGGT CTGGCGGTCG GTCACCCGGA ACTGATCGAG GCGCTGGAGC GGATCAAGAA CAGCTTCAAT TCCTATCCGC TGGATCGTAT GGCCATAGCT GGAGCGGCGG CGGCCTTCGA GGATCGTGCC TACTTCGAGC AGACCTGCCG GCAGGTGATC GACAGCCGCG AACGCCTGGT CGGTGAGTTG CAGCGTCTGG GCTTCGAAGT ATTGCCATCG GCAGCCAACT TCGTCTTCGC TCGCCACCCT GTTCACGATG CCGAACGATT GGCGGCAGGG CTGCGCGAGC AGGGGGTGAT AGTTCGCCAT TTCAAGCAGG AACGGATTCG TCAGTTCTTG CGCATCACCG TCGGAGCCCC GGAACAGAAC CGGGCGCTGA CCGATGTGCT GGCGGCCCTT TGCTGA
|
Protein sequence | MSKFWSPFVK NLVPYVPGEQ PKLSKLVKLN TNENPYGPSP RAIAAMQAEL NDGLRLYPDP NGERLKQAIA DYYGVRSSQV FVGNGSDEVL AHIFHGLFQH GRPLLFPDVT YSFYPVYCGL YAIPFETVAL DEEFRIRVED FVRPNGGIIF PNPNAPTGCL LPLESIERLL ESNPDSVVVV DEAYVDFGGE TAIGLVDRHA NLLVTQTLSK SRSLAGLRVG LAVGHPELIE ALERIKNSFN SYPLDRMAIA GAAAAFEDRA YFEQTCRQVI DSRERLVGEL QRLGFEVLPS AANFVFARHP VHDAERLAAG LREQGVIVRH FKQERIRQFL RITVGAPEQN RALTDVLAAL C
|
| |