Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0089 |
Symbol | |
ID | 4710526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 103596 |
End bp | 104891 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639854547 |
Product | hypothetical protein |
Protein accession | YP_001001686 |
Protein GI | 121996899 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.678836 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAGT TCGGCGTGAT CAATCCTCAG CCTTATATGG AGCCGGCGCT CGCCGAGCTG GAGCAGCACT TTGGCGTTGA GCGCATCTAT CCCAAGGGCT GGGGCGCCCG GGAGATCGAG GCCACGGCTC GGCACTGTCA CGAGCAGGGG GTGGTTGCCG TTGCCGGTTT CGCCCAGAAG GACGCCTTCC ATCACCTGCT GATCAACGAG CGCCTGGGCA ATCCGGTGCC CTCGCGGGTC GCCTTCTTCT ACTGCATGAA CAAGTATCTG ATGCGCACCC TGGAGCGGGA TCCCTTCTTC TATGCCCCGG TCGACCCGCT CCAGGAAAGC GATGACCAGA TCGCCGCGCG GGTGCCCGCG CACGAGTGGC CCTTCATGCT CAAGAACACC TCCCTGTCGC TCGGCCGGGG GATCTTCCGC ATCGCTAGCG TCGACGAGTT GCAGCGGGTG CTCGCCGACT ACCGGCAGGA TCATGAACTG CAGCGGGCGC TGGCACGCCA ATATGCAGCC TATCTTGATG GTGTTCCGCC GCAGCAGGTG CCGGCCCTGG CGCCGCCGTT CATCGCCGAG CACCTGGTCG ATATCAACCG CGCCACCGAG TACTGTTACG AGGGATATAT CACCAATGAT GGCGAGGTGG TTCACTACGG TCTGACCGAA GAGGTCTACT TCTCCAATCA TCAGGCGCTG GGGTATCTGA CCCCGCCGGT CTCCATCAGC CGGGACATGG CCGATACGAT TGAAGCGTGG GTCTCGGCGT ACATGCGTCG GCTGGCGGAC CTCGGTTACC GCAACCAGTT CTTCAACCTG GAGTTCTGGG TGATGCCCGA CGGCGCACTG CACCTGACCG AGATCAATCC GCGGGCCGCG CACACCTATC ACTACAACTA CCGTTACTCC TTCGGCAACT CGCTCTACGC AGACAATCTC CTGCTGGCCG CCGGCGAGCA GCCGGCGAGG CCCACACCCT GGGATCGCTG GCGGGCCGGC GGATCCTACC GGTATACGTT GATCGTGCTG ATCACTGCGC GGGAGTCAGG ACGCGTTGAT GAGATCCTCG ATTACGACTA TGTCGACGCC CTGGAGGCCG AGCAAGGGGT CCTGGTCCGG CATGTGCGTC GGCGCGATGA GGTCATCGAT GAGTCCGAGT TGTCGGCCGC GGGCGTGATG CTCCAGCAGC TCTGGATTAC CGCCCCTAGC TCCGTGGAGA TCATCGCCCG GGAGCGGGAG ATCCGCTCGC GCATCTACCG CAACCGGCAA GATGCCGTGG CCTATCCCCC CTTCTGGCGG ATTTAG
|
Protein sequence | MMKFGVINPQ PYMEPALAEL EQHFGVERIY PKGWGAREIE ATARHCHEQG VVAVAGFAQK DAFHHLLINE RLGNPVPSRV AFFYCMNKYL MRTLERDPFF YAPVDPLQES DDQIAARVPA HEWPFMLKNT SLSLGRGIFR IASVDELQRV LADYRQDHEL QRALARQYAA YLDGVPPQQV PALAPPFIAE HLVDINRATE YCYEGYITND GEVVHYGLTE EVYFSNHQAL GYLTPPVSIS RDMADTIEAW VSAYMRRLAD LGYRNQFFNL EFWVMPDGAL HLTEINPRAA HTYHYNYRYS FGNSLYADNL LLAAGEQPAR PTPWDRWRAG GSYRYTLIVL ITARESGRVD EILDYDYVDA LEAEQGVLVR HVRRRDEVID ESELSAAGVM LQQLWITAPS SVEIIARERE IRSRIYRNRQ DAVAYPPFWR I
|
| |