Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0895 |
Symbol | |
ID | 5732796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1022779 |
End bp | 1023924 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641278027 |
Product | phosphoribosylaminoimidazole carboxylase, ATPase subunit |
Protein accession | YP_001543671 |
Protein GI | 159897424 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.196503 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAAAC GTTTAGGAAT TCTTGGTGGC GGCCAATTGG CGCGGATGTT GGTGCAAGCG GCGATTCCCT TGGGAATTGA AACCGTGGTT TTGGCCAACC GCAACGATGA GCCAGCGGCA TTGGTCAGCA AAGCGGTGGT TGGCGAGTGG AGTGATCGAG CCTTGCTTGA TCAATTTGCT CAAGCGGTTG ATGTGGTAGT GCTGGAAAAC GAGTTTATTG GCTCGGAAAA ATTGGCCTAC CTTGCCAGCA AAGGCGTGCA ACTTGTCCCC GATCAGGCAA CGCTGGGCTT GATTGAGGAT AAAGCGCAAC AAAAATTAAC ATTGGCAGCC GCAGGCCTGC CTGTACCAGC CTTGGCCTTA ATCGAATCGT TGGACGATGT GGCGGCCTTT GGGGCTGAAC ATGGTTTCCC GTTGATGCTG AAAACGCGGC GCAACGGCTA CGATGGCCGT GGCACGGCCA AAATCCGCAG CGCCGAAGAG ATTGCCAGCG CTTGTAGCTC CTTGGGTTTT CCCGAAAATC CAGTGTTTGT TGAGGCATGG GTTCCCTTCG AAGCCGAGTT GGCAACCTTG ATTATTCGCT CGGTCAGCGG CGAACAATGC GTCTATCCAG TGATTGAAAC CTACCAACCG AGCGGCGTTT GTCGGGTGGT GCGTGCACCA GCACCCTTCT CAGCAGCAAT TCAACAACAG GCCAGCGAGG TTGCTCAAGC GACGGCAGCA GCGCTTGGTG GCTTAGGCAT CTTGGCGGTT GAAATGTTTC TGACCAGCAA GGGCCAAATT TTGGTCAATG AGCTAGCGCC CCGTCCACAC AACACTGGTC ACTATTCAAT TGAGGGTTGT TATTGTTCAC AATTTGAAAA TACGATTCGT GCTGCGCTGG GCTGGCCGTT GGGTGATCCT CAGTTGCGCC ACCAAAGTGC CGTGATGGTC AATGTGCTCG CGCCAACCAC ACGACCACTT GAAAGCAGCT TGATTCAAAC GGCGCTCAAA CCACAGGTGC ATGTGCATTG GTACGACAAA CGCAGCGCCA AGCCAGGCCG TAAAATTGGC CATATCACCG CTGTAGGAGC CGAACCAAGC GAAGTCGAAG CGCGTGCCCA AGCCGCCGTC GATCAGCTAG AGCGGGTTTT AGCTGAACCC GTATAA
|
Protein sequence | MTKRLGILGG GQLARMLVQA AIPLGIETVV LANRNDEPAA LVSKAVVGEW SDRALLDQFA QAVDVVVLEN EFIGSEKLAY LASKGVQLVP DQATLGLIED KAQQKLTLAA AGLPVPALAL IESLDDVAAF GAEHGFPLML KTRRNGYDGR GTAKIRSAEE IASACSSLGF PENPVFVEAW VPFEAELATL IIRSVSGEQC VYPVIETYQP SGVCRVVRAP APFSAAIQQQ ASEVAQATAA ALGGLGILAV EMFLTSKGQI LVNELAPRPH NTGHYSIEGC YCSQFENTIR AALGWPLGDP QLRHQSAVMV NVLAPTTRPL ESSLIQTALK PQVHVHWYDK RSAKPGRKIG HITAVGAEPS EVEARAQAAV DQLERVLAEP V
|
| |