Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2070 |
Symbol | |
ID | 4710106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2275527 |
End bp | 2276681 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639856543 |
Product | phosphoribosylaminoimidazole carboxylase, ATPase subunit |
Protein accession | YP_001003636 |
Protein GI | 121998849 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.669778 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGGCC CGGCGCCGAT CCTCCCCGGT GCAGCCATCG GGATCCTCGG CGCCGGTCAG CTCGGGCGCA TGCTCGCCAT GGCGGCGCGC CGCTCGGGCT ACCGGGTGCA CGTCATCGCC CCCGGCGCCG GGCAGGCGCC GGCCGGGCAG GTGGCGGATC GGGTGCACGA TGCCGAGCCC ACGGCGGAGC TGCTGTCCAG CCTGGCCGAC GAGGTGAGCG TGCTCACCTA CGAGTTCGAG AACCTGCCGC GTGCGGCCGT CGAGGCCGCC GCCGAGCGCC TGCCGGTGCG GCCATCGCCC CGTGCTCTGG CCACCACCCA GCACCGTATT CTGGAGAAGA CCTTCCTGCG CGAACACGGC CTGCCCGTGG TGCCCTTCGA GGCCGTTCAC GGCCCCGAGG AGGCGGCTGC TGCCGTGGCG CGTATCGGTG CCCCAGCGGT GATCAAGAGC GCCGGGCTGG GCTACGACGG CAAGGGCCAG GCCCGGGTGG AGAGCGCCGA TGAGGTATCG GCCGCGTGGT CGGCTGTCGG CGCCGACGAG GCGGTGGTCG AGGCCTGCGT CGATCTCGCC ATGGAGGTCT CCGTGGTCGC CGCCCGCGGC GTCGATGGCA GCTTCGCCCA CTACGGGGTG ACCGAGAACC GCCACCGGCA CCACATCCTC GATCTATCGA TCGGCGACGC CGAGCTCGAC CCGGCGGTGT GCCGGCAGGC CGTGGAGATC GCCCGAGCGG TGGGCGAGGG TCTCGACGCT GTCGGCACCT ACTGCGTGGA GTTCTTCATC GACGGCGCCG GGCGGCTGAT GGTCAACGAG ATCGCCCCGC GCCCGCACAA CTCCGGGCAT CTGACCATCG AGGGGGCGGC AACCTCGCAG TTCGACCAGC AGCTGCGTGC CATCTGCGGA CTGCCCCTGG GCAGTACCCG GCGGTTGGCC CCGGCGGCCA TGGTGAACCT GCTCGGCGAC GTCTGGGATG CCGGCACGCC GCCGTGGGCC GAGGTCTACC AGGAGCCGAC GGCCACCCTG CACCTCTACG GCAAGGGGGC GCCGAGCCCC GGCCGGAAGA TGGGCCATAT CACCGTCCTC GGCGAGGACC GGCAGGAGGC CGCCGAGCGC GCCCTGAACC TGCGCAACCG ACTGGCACCC CATGTGGTTT CGTAA
|
Protein sequence | MSGPAPILPG AAIGILGAGQ LGRMLAMAAR RSGYRVHVIA PGAGQAPAGQ VADRVHDAEP TAELLSSLAD EVSVLTYEFE NLPRAAVEAA AERLPVRPSP RALATTQHRI LEKTFLREHG LPVVPFEAVH GPEEAAAAVA RIGAPAVIKS AGLGYDGKGQ ARVESADEVS AAWSAVGADE AVVEACVDLA MEVSVVAARG VDGSFAHYGV TENRHRHHIL DLSIGDAELD PAVCRQAVEI ARAVGEGLDA VGTYCVEFFI DGAGRLMVNE IAPRPHNSGH LTIEGAATSQ FDQQLRAICG LPLGSTRRLA PAAMVNLLGD VWDAGTPPWA EVYQEPTATL HLYGKGAPSP GRKMGHITVL GEDRQEAAER ALNLRNRLAP HVVS
|
| |