Gene Hhal_0056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0056 
Symbol 
ID4710505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp62060 
End bp63517 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content70% 
IMG OID639854514 
ProductFmu (Sun) domain-containing protein 
Protein accessionYP_001001653 
Protein GI121996866 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00446] NOL1/NOP2/sun family putative RNA methylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCCGAC GAGAGCGATA CCAGCCGCCC GAGGATCTCG CTCGCGGGGA GAGCGCCTAC 
ACGCGTTATC GAGCGGTGAT CGACGATTGG CAGGCGTTCT GCGACGCGCT GCAACGCCCC
CTGAGCCCGT GTCTGCGCGC CAACACGACT CGCCTCACCC GGGATGAGTT GGACAAGCTG
CTGCGTGATG AGGGTTGGGA TCCCCGCCCG CTGGGCTGGC AGGGGGATGC CTTGCTCGTG
GACGGAGGGT TTCGCCCCGG GCACCACTGG GGCGCCATCG CCGGCCTCTA CCAGATCCAG
GAGGCGGCCT CGCTCCTTCC TGTGCAGCTG CTCGATCCCC GGCCGGGTGA GCGAGTCCTC
GATCTGTGCG CGGCGCCGGG GAACAAGACC GTCCAGGTGG CGGATGCCCT GGGTAATCGG
GGCACCGTGG TCGCCAACGA TGCAAGCGCG GGTCGGCTGG GCGCGCTGGG GCAGGCGGTG
AAGCGCCACG GGGTGGTCAA TGTCTCGCAG ACCGTCCGCG ATGGCCAGGG CATGCCTTGG
GCCGCCGGGC GTTTCGATAA GGTCGTGGTC GACGCACCGT GCAGCTGCGA GGGTACCTTC
CGTAAGACGG CCACGGCGGC CGAGCCGACG TCCCCTGCGT TCCGCCAGCG GCTTGTCCAG
CGGCAGCAGC GCCTGTTGCT GCGCGGCATG GCGCTGACCC GCCCCGGGGG CACCGTGGTC
TACTCGACAT GCACCTTCGC TCCGGAGGAG AACGAGGCCG TGGTCGCCGC GGCGCTGGCC
CGTTGTTCCG GGGCGTTCGA GTTGATCCCT GCGCGGGTCG CCGGCCTGCA GCTGAGCCCG
GGACTGGAGG CGTGGGACGG AGTCGACTTC GGCGCCGACA TGGCCGCCTG TGGCCGCCTC
TGGCCCCACC ACAACGACAC CGGCGGGTTT TTCGTGGCCC TACTGCGGCG TGTCGACGAC
GGGAGCAGCC AGTCGGAGGA CCCGCTACCC CTGCCGGAAG AGCCGCGCGC CCGCACCTTG
CTGCAGACCT TTGAGGATGA GCTCGGGGTG TCGGCTGAGG TGCTCGATGG ACTGACCGCG
TTCTTTGAGG GCAGCAAGTA CGCCAAGGTC GTGGCGGCGG ATCACACCGC AGCGGGCGGA
ATCCCGGTCG TGCGCAGCGG CATCCCTGCC GTGCGGGCCC AGACCCGGCC TCCGAAGCCG
TCCACGGCTG GGGTCATGGC GCTGGGACAC CACGCCCGGG GAGCGGTGCT GGAGTTGGAG
CGGGCCGAGG TCTACGCGTT TTTCCGTCGT GAGCCGCTGC TGCTGGGGCC GGAGCGTGGC
AGCGGGCTGC GCGAGGGGGG GCACGTAGTG CTGCGCCACC GCGGCCATAC CATCGGTATC
GGTATGTACC GCAACGGCGC GATGGTCAGC CTGTTCCCCA AGGCGTGGTC GCGGGCTGCC
GGAGGCACCG TGGGATAG
 
Protein sequence
MARRERYQPP EDLARGESAY TRYRAVIDDW QAFCDALQRP LSPCLRANTT RLTRDELDKL 
LRDEGWDPRP LGWQGDALLV DGGFRPGHHW GAIAGLYQIQ EAASLLPVQL LDPRPGERVL
DLCAAPGNKT VQVADALGNR GTVVANDASA GRLGALGQAV KRHGVVNVSQ TVRDGQGMPW
AAGRFDKVVV DAPCSCEGTF RKTATAAEPT SPAFRQRLVQ RQQRLLLRGM ALTRPGGTVV
YSTCTFAPEE NEAVVAAALA RCSGAFELIP ARVAGLQLSP GLEAWDGVDF GADMAACGRL
WPHHNDTGGF FVALLRRVDD GSSQSEDPLP LPEEPRARTL LQTFEDELGV SAEVLDGLTA
FFEGSKYAKV VAADHTAAGG IPVVRSGIPA VRAQTRPPKP STAGVMALGH HARGAVLELE
RAEVYAFFRR EPLLLGPERG SGLREGGHVV LRHRGHTIGI GMYRNGAMVS LFPKAWSRAA
GGTVG