Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0556 |
Symbol | |
ID | 4710564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 631977 |
End bp | 632948 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639855014 |
Product | cysteine synthases |
Protein accession | YP_001002144 |
Protein GI | 121997357 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.523099 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGCGC CCGCGCCCGA AGGCGGGCGG ATCTACGACG ACTTCGTGGC CACCATCGGC GAGACCCCGC TGGTGCGGCT GTCGCGCCTG GGTGCCGAAG CCGGTTTGGC CGCAGAGCTG CTGGCCAAGC TGGAGATGTT CAATCCGCTG AGTTCGGTGA AGGATCGCAT CGCCCTGGCG ATGATCGAGG CCGCGGAAGC GGAGGGACGC ATCACGCCGG GGCGCTCGAC GCTCATCGAG CCTACCTCGG GGAACACCGG CATTGGCCTG GCCTTCATCG CCGCGGCCCG TGGGTACCGG TTGATCCTCA CCATGCCGGC GAACATGTCC GCCGAGCGTC GCAAGCTGTT TGCCTTCTAC GGTGCCCGGG TGGAACTGAC CGACCCGGCG GCGGGCATGC GCGGCGCCAT CGACCGGGCC GAGGCTCTGC TCGAGGAGAT CCCGGATAGC TTCACGCCGG CGCAATTCAG CAATCCGGCC AATCCCCGCG CACACCGCCT GCGTACCGCG GAGGAGATCT GGCGCGATAC CGGCGGGTCG GTGGACGGTC TGATCGCCGG GGTGGGGACC GGCGGGACCC TGACCGGGAT CGCCGGGGTG CTCAAGGAGC GCCGGCCCGG TTTCCGCGCC TACGCCGTGG AGCCGGCCGG GTCGCCGGTG CTCTCCGGTG GGGCGCCGGG GCCCCACGGT CTGCAGGGCA TCGGTGCCGG CTTTCTGCCG GATACCCTGG ATGCCACCCT GGTCGATGAG ACCCTGCAGG TGACGGACGA CGAGGCGCTG GCCATGAGCC GTCGGGTGGC GAGGCTGGAA GGGATCCCGT GCGGAATCTC CAGTGGGGCG GCGCTGTCGG CGGCGGTCCA GGTGGCCGCC CGGCCGGAGC TGGCCGGACA GCGTCTGGTG GTCATCCTCG CCTCTGGCGC CGAGCGCTAT CTATCCACAC CCCTGTTCGA GGGGCTGGAC GACGAGGCTT AG
|
Protein sequence | MEAPAPEGGR IYDDFVATIG ETPLVRLSRL GAEAGLAAEL LAKLEMFNPL SSVKDRIALA MIEAAEAEGR ITPGRSTLIE PTSGNTGIGL AFIAAARGYR LILTMPANMS AERRKLFAFY GARVELTDPA AGMRGAIDRA EALLEEIPDS FTPAQFSNPA NPRAHRLRTA EEIWRDTGGS VDGLIAGVGT GGTLTGIAGV LKERRPGFRA YAVEPAGSPV LSGGAPGPHG LQGIGAGFLP DTLDATLVDE TLQVTDDEAL AMSRRVARLE GIPCGISSGA ALSAAVQVAA RPELAGQRLV VILASGAERY LSTPLFEGLD DEA
|
| |