Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3334 |
Symbol | |
ID | 5735204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4202511 |
End bp | 4203440 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280481 |
Product | cysteine synthase A |
Protein accession | YP_001546098 |
Protein GI | 159899851 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.229416 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGGA TTTATGACAA CATTACCCAA CTTATCGGCA ACACGCCGCT GGTGCGCTTG GGCAAAGTTA ATACCACGGG GGCGACGGTA TTAGCTAAGC TTGAGTTCTT CAACCCAGCC AGCAGCGTCA AAGATCGGAT TGGCTTGGCG ATGATCGAGG CAGCGGAGGC CGCCGGTCTG ATCAATCCAA ACGATACCAC GATTATCGAG CCAACCAGTG GCAATACGGG GATTGGCCTC GCATTTGTGG CTGCCGCGAA AGGCTACCGC ATTGTGCTGA CCATGCCCGA AACCATGAGC TTGGAACGGC GCAAATTACT CAAAGGCTTT GGAGCCGAAT TAGTTTTGAC TCCAGGCTCC GAGGGCATGC CTGGGGCAAT TCGTCGTGCC GAAGAATTGG CCGCCGAAAA TCCAGGCAGC TTTATTCCGC AACAATTCAA AAATAAAGCC AACCCAGCGG TTCACCAACG CACCACCGCT GAAGAAATTT GGAATGATAC CGATGGCGCT GTCGATATTT TGGTGGCCGG GGTTGGGACT GGTGGCACAA TCACTGGGGT TGCTTCGGTG CTCAAAGAAC GCAAGCCAGG CTTTAAGGCA ATTGCGGTCG AGCCAACTGC CTCGCCAGTC CTTTCTGGGG GCAAAATGGG GCCGCACAAA ATCCAAGGGA TCGGTGCTGG CTTTGTGCCC GACGTGCTTG ATACCAGCGT AATTGATGAA ATTATTCAAG TTACCAACGA ACATGCCTTT GAATGGGCAC GCAAATTGGC TCACGAAGAA GGCTTGATGG TGGGAATTAG CTCAGGGGCC GCCGCATGGG CCGCCTTGCA AGTAGCAGCC CGCCCTGAAA ATGCTGGCAA AACGATTGTC TTTATTGTGC CAAGCAACGG CGAACGCTAC CTGAGCACGC CATTGTTTGA TGCTGAGTAG
|
Protein sequence | MARIYDNITQ LIGNTPLVRL GKVNTTGATV LAKLEFFNPA SSVKDRIGLA MIEAAEAAGL INPNDTTIIE PTSGNTGIGL AFVAAAKGYR IVLTMPETMS LERRKLLKGF GAELVLTPGS EGMPGAIRRA EELAAENPGS FIPQQFKNKA NPAVHQRTTA EEIWNDTDGA VDILVAGVGT GGTITGVASV LKERKPGFKA IAVEPTASPV LSGGKMGPHK IQGIGAGFVP DVLDTSVIDE IIQVTNEHAF EWARKLAHEE GLMVGISSGA AAWAALQVAA RPENAGKTIV FIVPSNGERY LSTPLFDAE
|
| |