Gene Hhal_0536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0536 
Symbol 
ID4710791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp604962 
End bp606212 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content69% 
IMG OID639854993 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_001002124 
Protein GI121997337 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATGG TCGAGGAGCC GGCCCGTGAG GGGCTCGTGG ACATGGATCG GCTGCGCGCC 
GAGTTTCCGG TGCTCCGCCG GGAGGTCAAC GGACGGCCGC TGGTCTACCT CGACAGCGCG
GCCAGTGCGC AGAAGCCGCA GTCGGTGATC GATGCCGAGC TCGATTGTTA TCAGCACTAC
TACGCCAACG TGCACCGCGG GGTGCACACC CTCTCCCAGG AGGCCACCAC CGCCTTTGAA
GGGGCGCGGA GCGAGGCCCA GCGGTTCCTC AATGCCCCCA GCGAGCGGGA GATTATCTTC
CTGCGCGGGG TGACCGAGGC GATCAACCTG GTGGCCCACA GCTTTGTGGC GCCGCGGGTC
GGTCCCGGTG ACGAGATCCT GGTCACCCAC ATGGAGCACC ACTCCAACAT CGTCCCCTGG
CAGCTGGTCT GTGAGCGCAC CGGGGCGCAG CTGCGGGTGG TGCCCATCGA CGACAACGGG
GATGTGGACC TGGAGACGGT GCGCGGCATG ATCCACGAGC GCACCCGGCT GGTCAGCGTG
GTCCACGTCT CCAACGCCCT GGGGGCGGTC AATCCGGTGG CGGAGATCGC CGCCATGGCC
CGAGCCCAAG GCGTGCCGGT GCTGCTCGAC GGCGCCCAGG CGGCCCCGCA TCTGCCGGTG
GATATTCAGG AGCTGGGAGT CGACTTCTAC GCCTTTTCCG GGCACAAGGC CTACGGCCCG
ACCGGGATCG GCGTCCTCTG GGGCCGCTAT GAGCACCTGG CGGGCATGGT GCCCTACCAG
GGGGGCGGCG ACATGATCCG CCACGTCTCC TTCTCCGGCA CCGAGTACGC CGCGCCGCCG
GCGCGTTTCG AGGCGGGGAC GCCGAATATC GCTGGCGCCA TCGGTCTGGG CGAGGCCCTG
CGCTACATCG ACGCCATCGG CCGTGAGCGG ATCGCCGCCC GCGAAGAGGA CCTGGTCAAC
CACGCCGCCG AGGCCATCGC CGCCGTGCCG GGGGTGCAGC TGATCGGCCG GCCCCAACGC
CGGGCCGGTG CCGTGTCTTT CGTGATGGAA GGGACGCACC CCAACGATCT GGCCATGCTC
CTCGATGAGC AGGGGATCGC GATCCGCGCC GGCCATCACT GCGCCCAGCC GGTGATGGAG
CGCTTCGGTG TGCCCGCCAC GGCGCGCGCC TCGTTCGGGG TCTACAACAC CCACGATGAG
GTGGAGAGCC TGGTGGTCGG CCTGGAGAAG ATCCGCCGCC TGTTCGGCTG A
 
Protein sequence
MSMVEEPARE GLVDMDRLRA EFPVLRREVN GRPLVYLDSA ASAQKPQSVI DAELDCYQHY 
YANVHRGVHT LSQEATTAFE GARSEAQRFL NAPSEREIIF LRGVTEAINL VAHSFVAPRV
GPGDEILVTH MEHHSNIVPW QLVCERTGAQ LRVVPIDDNG DVDLETVRGM IHERTRLVSV
VHVSNALGAV NPVAEIAAMA RAQGVPVLLD GAQAAPHLPV DIQELGVDFY AFSGHKAYGP
TGIGVLWGRY EHLAGMVPYQ GGGDMIRHVS FSGTEYAAPP ARFEAGTPNI AGAIGLGEAL
RYIDAIGRER IAAREEDLVN HAAEAIAAVP GVQLIGRPQR RAGAVSFVME GTHPNDLAML
LDEQGIAIRA GHHCAQPVME RFGVPATARA SFGVYNTHDE VESLVVGLEK IRRLFG