Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1622 |
Symbol | |
ID | 6975038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 1802656 |
End bp | 1804011 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643391158 |
Product | phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase |
Protein accession | YP_002276015 |
Protein GI | 209543786 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase [COG1247] Sortase and related acyltransferases |
TIGRFAM ID | [TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAGA TCGGCACGCC GCCCCAGACC CGCCCACGCG CCGAACGGGT GCAGGCGCTG CATGAAGACG ATCTGCAGGC GCTGTGCGAA GCCACCGACG CCGCGATCCT GGATGGCGGG GGGTTCGGCT GGCTGACCAC GCCCGGCCGG CAGGCGATGG AGCGCTATTT TCGCGGCGTG CTGATGGTGC CGGAGCGCAT GCTGTTCGTG GTCCGGCTGG ACGGCATCAT CGTGGGTGCC GCCCAACTGG TGCGCCCGCC GCGCAATAAC GAGGCGCAGG CCATGAGCGC CACGCTGATG CACCTGTATG TCGCGCCCTA TGCCAGGGGC CTGGGGCTGG GCCGGCTGCT GCTGCTGGAG GCCGAGCAAT GCGCGCGGGC CATGGGGTAC CAGATCCTGA ACCTGGACGT GCGCGAGACG CAGGAATCGG CCATCCGGCT GTTCCGCGCC TTCGGCTTCC ATCACTGGGG CACCCATCCC AGCTATGCCC GCACGGAGGG GCGCACGGTG CGGGGCCTGT TCTTCACCAA GCGCCTGCAG GACAATGAAC GGGTGGTACC CGCGCATCCG CAGGCCGCTT CCATTCCCAT TCCTGCCACG GGACCCGCCA GCGTGACCGG ACACAGCCTG ACCCTGTACC CCGCCATCGA CCTGAAGGAC GGGGCATGCG TGCGCCTGCG CCGCGGGGAA ATGGACGATG CCACGATCTA TTCCGACAAT CCGGGCGCGC AGGCCCGTGC CTGGGTGCAG GCGGGTTGCC GGTGGCTGCA TGTCGTGGAC CTGAACGGTG CGTTCGCCGG CCGGTCGGCC AATGGCGACG CGGTCGAGGC GATCATCGCC AACGCCACCG TGCCGGTGCA GCTGGGCGGC GGCCTGCGGG ACATGGCGGG CATCGAACGC TGGCTGGCCG CCGGTGTCAC GCGCGTCATC CTGGGCAGCG TCGCGGTCAA GGACCCCGAG CTGGTGCGCG CGGCCTGCCG CGCCTTCCCC GGCCGGATCG TGGCGGGGAT CGACGCGCGA TCGGGCCAGG TGGCGACCGA GGGCTGGGCC GAGACGTCGG ACATGAAGGC GGTCGAACTG GCCCGCCGGA TGGAGGGCGT GGGGGTGGCC GCGATCATCT TCACCGAGAT CAGCCGCGAC GGCATGCTGA CGGGCATCGA CATCGCCCAG ACCGTCGAGA TGGCGAATGC GCTGTCGATC CCGGTCATCG CCAGCGGCGG GGTCGGCCAT GCCGACCACC TGCATGCCCT GCGTGCCGCG ACGGTGCAGG CACCGGGGAT CGAGGGCGTG ATCGTCGGCC GGGCGCTGTA TGACGGCCGG GTGGACCCGG CCGAGGCCTT GCGGATCCTG TCCTGA
|
Protein sequence | MNEIGTPPQT RPRAERVQAL HEDDLQALCE ATDAAILDGG GFGWLTTPGR QAMERYFRGV LMVPERMLFV VRLDGIIVGA AQLVRPPRNN EAQAMSATLM HLYVAPYARG LGLGRLLLLE AEQCARAMGY QILNLDVRET QESAIRLFRA FGFHHWGTHP SYARTEGRTV RGLFFTKRLQ DNERVVPAHP QAASIPIPAT GPASVTGHSL TLYPAIDLKD GACVRLRRGE MDDATIYSDN PGAQARAWVQ AGCRWLHVVD LNGAFAGRSA NGDAVEAIIA NATVPVQLGG GLRDMAGIER WLAAGVTRVI LGSVAVKDPE LVRAACRAFP GRIVAGIDAR SGQVATEGWA ETSDMKAVEL ARRMEGVGVA AIIFTEISRD GMLTGIDIAQ TVEMANALSI PVIASGGVGH ADHLHALRAA TVQAPGIEGV IVGRALYDGR VDPAEALRIL S
|
| |