Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1694 |
Symbol | |
ID | 4710048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1850260 |
End bp | 1851255 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639856161 |
Product | cysteine synthase A |
Protein accession | YP_001003260 |
Protein GI | 121998473 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAGCA AGCGGGAGTC GAGCATGTCG ATCTGCGATG GATTCGTCGG CGCGATCGGG AACACGCCGC TGATTCGCCT GCCGCGGCTG AGCGAGGAGA CCGGCTGCGA GATCCTCGGC AAGGCCGAGT TCATGAACCC CGGCGGGTCT GTGAAGGATC GGGCCGCGCT GGCCATCGTC CAGGCCGCGG AACGCAGTGG CGAGCTGCGT CCTGGTGGAA CGGTGGTGGA GGGCACCGCC GGCAACACCG GCATCGGGTT GGCCCATATC TGTAATGCGC GCGGCTACCG CTGCGTGATT GTGATCCCGG AGACCCAGAG CCAGGAGAAG ATCGACCTGC TCCGCACCCT CGGTGCCGAG GTCCATGCGG TTCCTGCCGC GCCCTACCGC GACCCGGGGA ACTACCAGAA GGTGGCCGGC CGCATGGCGG CGGAGATGGA CAACGCGGTC TGGGCCAACC AGTTCGATAA CACCGCCAAC CGCCTTGGCC ACTACCGCAC CACTGGGCCG GAGGTCTGGG CGCAGACCGG CGGTCGGGTC GACGCGTTCG TTGCCGCCAC CGGCACGGGC GGCACGCTGG CCGGGGTCTC TCGGGCGCTC AAGGAGCGCT CCCCGGATAC CCGCATCTAC CTGGCCGACC CTTCGGGCAG CGCCCTCTAT AACTTCGTGC GCGATGGCGA GCCGGCGCCC ACTGCGGGCA ATTCCATCAC CGAAGGGATT GGCAGCAGCC GTGTAACAGC TAATCTTCAG GGTACGGACA TCGATGACGC GTTTTGCATA TCGGACGCCG AATCGGTGCC GATGGTCTAC CGGCTCCTGA GGGAAGAGGG GCTATTCCTG GGCAGTTCTT CGGGGGTCAA TGTGTGCGGG GCCGTGCGTG CCGCCGAGGA ATTGGGGCCC GGGCACACGG TGGTGACGAT CCTCTGCGAT GGAGGGGGCC GCTATTACTC GAGACTCTTC AATGAAGCCT GGCTGGCGGA GCGGGGACTC GCCTGA
|
Protein sequence | MSSKRESSMS ICDGFVGAIG NTPLIRLPRL SEETGCEILG KAEFMNPGGS VKDRAALAIV QAAERSGELR PGGTVVEGTA GNTGIGLAHI CNARGYRCVI VIPETQSQEK IDLLRTLGAE VHAVPAAPYR DPGNYQKVAG RMAAEMDNAV WANQFDNTAN RLGHYRTTGP EVWAQTGGRV DAFVAATGTG GTLAGVSRAL KERSPDTRIY LADPSGSALY NFVRDGEPAP TAGNSITEGI GSSRVTANLQ GTDIDDAFCI SDAESVPMVY RLLREEGLFL GSSSGVNVCG AVRAAEELGP GHTVVTILCD GGGRYYSRLF NEAWLAERGL A
|
| |