Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0536 |
Symbol | |
ID | 4710791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 604962 |
End bp | 606212 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639854993 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_001002124 |
Protein GI | 121997337 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATGG TCGAGGAGCC GGCCCGTGAG GGGCTCGTGG ACATGGATCG GCTGCGCGCC GAGTTTCCGG TGCTCCGCCG GGAGGTCAAC GGACGGCCGC TGGTCTACCT CGACAGCGCG GCCAGTGCGC AGAAGCCGCA GTCGGTGATC GATGCCGAGC TCGATTGTTA TCAGCACTAC TACGCCAACG TGCACCGCGG GGTGCACACC CTCTCCCAGG AGGCCACCAC CGCCTTTGAA GGGGCGCGGA GCGAGGCCCA GCGGTTCCTC AATGCCCCCA GCGAGCGGGA GATTATCTTC CTGCGCGGGG TGACCGAGGC GATCAACCTG GTGGCCCACA GCTTTGTGGC GCCGCGGGTC GGTCCCGGTG ACGAGATCCT GGTCACCCAC ATGGAGCACC ACTCCAACAT CGTCCCCTGG CAGCTGGTCT GTGAGCGCAC CGGGGCGCAG CTGCGGGTGG TGCCCATCGA CGACAACGGG GATGTGGACC TGGAGACGGT GCGCGGCATG ATCCACGAGC GCACCCGGCT GGTCAGCGTG GTCCACGTCT CCAACGCCCT GGGGGCGGTC AATCCGGTGG CGGAGATCGC CGCCATGGCC CGAGCCCAAG GCGTGCCGGT GCTGCTCGAC GGCGCCCAGG CGGCCCCGCA TCTGCCGGTG GATATTCAGG AGCTGGGAGT CGACTTCTAC GCCTTTTCCG GGCACAAGGC CTACGGCCCG ACCGGGATCG GCGTCCTCTG GGGCCGCTAT GAGCACCTGG CGGGCATGGT GCCCTACCAG GGGGGCGGCG ACATGATCCG CCACGTCTCC TTCTCCGGCA CCGAGTACGC CGCGCCGCCG GCGCGTTTCG AGGCGGGGAC GCCGAATATC GCTGGCGCCA TCGGTCTGGG CGAGGCCCTG CGCTACATCG ACGCCATCGG CCGTGAGCGG ATCGCCGCCC GCGAAGAGGA CCTGGTCAAC CACGCCGCCG AGGCCATCGC CGCCGTGCCG GGGGTGCAGC TGATCGGCCG GCCCCAACGC CGGGCCGGTG CCGTGTCTTT CGTGATGGAA GGGACGCACC CCAACGATCT GGCCATGCTC CTCGATGAGC AGGGGATCGC GATCCGCGCC GGCCATCACT GCGCCCAGCC GGTGATGGAG CGCTTCGGTG TGCCCGCCAC GGCGCGCGCC TCGTTCGGGG TCTACAACAC CCACGATGAG GTGGAGAGCC TGGTGGTCGG CCTGGAGAAG ATCCGCCGCC TGTTCGGCTG A
|
Protein sequence | MSMVEEPARE GLVDMDRLRA EFPVLRREVN GRPLVYLDSA ASAQKPQSVI DAELDCYQHY YANVHRGVHT LSQEATTAFE GARSEAQRFL NAPSEREIIF LRGVTEAINL VAHSFVAPRV GPGDEILVTH MEHHSNIVPW QLVCERTGAQ LRVVPIDDNG DVDLETVRGM IHERTRLVSV VHVSNALGAV NPVAEIAAMA RAQGVPVLLD GAQAAPHLPV DIQELGVDFY AFSGHKAYGP TGIGVLWGRY EHLAGMVPYQ GGGDMIRHVS FSGTEYAAPP ARFEAGTPNI AGAIGLGEAL RYIDAIGRER IAAREEDLVN HAAEAIAAVP GVQLIGRPQR RAGAVSFVME GTHPNDLAML LDEQGIAIRA GHHCAQPVME RFGVPATARA SFGVYNTHDE VESLVVGLEK IRRLFG
|
| |