Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_04620 |
Symbol | hisA |
ID | 7759420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 436716 |
End bp | 437453 |
Gene Length | 738 bp |
Protein Length | 245 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643803384 |
Product | 1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase |
Protein accession | YP_002797692 |
Protein GI | 226942619 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase |
TIGRFAM ID | [TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGATCA TTCCCGCAAT CGACCTCAAG GACGGCGCCT GCGTGCGCCT GCGCCAGGGC CGCATGGACG ATTCCACGGT GTTCTCCGAC GACCCGGTGG CCATGGCCGC CAGATGGGTC GAAGCCGGCT GCCGCCGCCT GCACCTGGTC GACCTCAACG GCGCCTTCGA GGGCCAGCCG ATCAACGGCG AAGTGGTCAC CGCCATCGCC CGGCGCTACC CGGACCTGCC GATCCAGATC GGCGGCGGCA TACGTTCGCT GGAGACCATC GAGCACTACG TGAAGGCGGG CGTCGGCTAC GTGATCATCG GCACCAAGGC GGTCAAGCAG CCGGAGTTCG TCGGCGAAGC CTGCCGAGCC TTTCCCGGCA AGGTGATCGT CGGCCTGGAT GCCAGGGACG GCTTCGTCGC CACCGACGGC TGGGCCGAGG TCAGCAGCGT GCAGGTCGTC GACCTGGCCA GGCGCTTCGA GGCCGACGGC GTGTCCGCCA TCGTCTATAC GGATATCGCC AAGGACGGCA TGATGCAGGG CTGCAATGTC GAGGCCACCG CCGCCCTGGC CGCCGCCACG CGCATCCCGG TGATCGCCTC CGGCGGCATC CACGACCTCG GCGACATCCG CAAGCTGCTG GATGCTCGCG CTCCGGGGAT CGTCGGCGCC ATCACCGGCC GCGCCATCTA CGAGGGTACC CTGGACGTGG CCGAAGCCCA GGCGCTGTGC GACGGCTTCA AGGGCTGA
|
Protein sequence | MLIIPAIDLK DGACVRLRQG RMDDSTVFSD DPVAMAARWV EAGCRRLHLV DLNGAFEGQP INGEVVTAIA RRYPDLPIQI GGGIRSLETI EHYVKAGVGY VIIGTKAVKQ PEFVGEACRA FPGKVIVGLD ARDGFVATDG WAEVSSVQVV DLARRFEADG VSAIVYTDIA KDGMMQGCNV EATAALAAAT RIPVIASGGI HDLGDIRKLL DARAPGIVGA ITGRAIYEGT LDVAEAQALC DGFKG
|
| |