Gene Hhal_0089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0089 
Symbol 
ID4710526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp103596 
End bp104891 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content63% 
IMG OID639854547 
Producthypothetical protein 
Protein accessionYP_001001686 
Protein GI121996899 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.678836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAGT TCGGCGTGAT CAATCCTCAG CCTTATATGG AGCCGGCGCT CGCCGAGCTG 
GAGCAGCACT TTGGCGTTGA GCGCATCTAT CCCAAGGGCT GGGGCGCCCG GGAGATCGAG
GCCACGGCTC GGCACTGTCA CGAGCAGGGG GTGGTTGCCG TTGCCGGTTT CGCCCAGAAG
GACGCCTTCC ATCACCTGCT GATCAACGAG CGCCTGGGCA ATCCGGTGCC CTCGCGGGTC
GCCTTCTTCT ACTGCATGAA CAAGTATCTG ATGCGCACCC TGGAGCGGGA TCCCTTCTTC
TATGCCCCGG TCGACCCGCT CCAGGAAAGC GATGACCAGA TCGCCGCGCG GGTGCCCGCG
CACGAGTGGC CCTTCATGCT CAAGAACACC TCCCTGTCGC TCGGCCGGGG GATCTTCCGC
ATCGCTAGCG TCGACGAGTT GCAGCGGGTG CTCGCCGACT ACCGGCAGGA TCATGAACTG
CAGCGGGCGC TGGCACGCCA ATATGCAGCC TATCTTGATG GTGTTCCGCC GCAGCAGGTG
CCGGCCCTGG CGCCGCCGTT CATCGCCGAG CACCTGGTCG ATATCAACCG CGCCACCGAG
TACTGTTACG AGGGATATAT CACCAATGAT GGCGAGGTGG TTCACTACGG TCTGACCGAA
GAGGTCTACT TCTCCAATCA TCAGGCGCTG GGGTATCTGA CCCCGCCGGT CTCCATCAGC
CGGGACATGG CCGATACGAT TGAAGCGTGG GTCTCGGCGT ACATGCGTCG GCTGGCGGAC
CTCGGTTACC GCAACCAGTT CTTCAACCTG GAGTTCTGGG TGATGCCCGA CGGCGCACTG
CACCTGACCG AGATCAATCC GCGGGCCGCG CACACCTATC ACTACAACTA CCGTTACTCC
TTCGGCAACT CGCTCTACGC AGACAATCTC CTGCTGGCCG CCGGCGAGCA GCCGGCGAGG
CCCACACCCT GGGATCGCTG GCGGGCCGGC GGATCCTACC GGTATACGTT GATCGTGCTG
ATCACTGCGC GGGAGTCAGG ACGCGTTGAT GAGATCCTCG ATTACGACTA TGTCGACGCC
CTGGAGGCCG AGCAAGGGGT CCTGGTCCGG CATGTGCGTC GGCGCGATGA GGTCATCGAT
GAGTCCGAGT TGTCGGCCGC GGGCGTGATG CTCCAGCAGC TCTGGATTAC CGCCCCTAGC
TCCGTGGAGA TCATCGCCCG GGAGCGGGAG ATCCGCTCGC GCATCTACCG CAACCGGCAA
GATGCCGTGG CCTATCCCCC CTTCTGGCGG ATTTAG
 
Protein sequence
MMKFGVINPQ PYMEPALAEL EQHFGVERIY PKGWGAREIE ATARHCHEQG VVAVAGFAQK 
DAFHHLLINE RLGNPVPSRV AFFYCMNKYL MRTLERDPFF YAPVDPLQES DDQIAARVPA
HEWPFMLKNT SLSLGRGIFR IASVDELQRV LADYRQDHEL QRALARQYAA YLDGVPPQQV
PALAPPFIAE HLVDINRATE YCYEGYITND GEVVHYGLTE EVYFSNHQAL GYLTPPVSIS
RDMADTIEAW VSAYMRRLAD LGYRNQFFNL EFWVMPDGAL HLTEINPRAA HTYHYNYRYS
FGNSLYADNL LLAAGEQPAR PTPWDRWRAG GSYRYTLIVL ITARESGRVD EILDYDYVDA
LEAEQGVLVR HVRRRDEVID ESELSAAGVM LQQLWITAPS SVEIIARERE IRSRIYRNRQ
DAVAYPPFWR I