Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_0783 |
Symbol | |
ID | 8741366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 801666 |
End bp | 802613 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646511361 |
Product | cysteine synthase A |
Protein accession | YP_003402352 |
Protein GI | 284164073 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACCAG CAGTCGACTC CGCGGCCGAG ACCGAGATCG ACGTCGCCGA GACCGTCGAC GAACTGATCG GCCGAACGCC GCTGTTGCGA CTGGACGCGT TCGCCGACAA CTGTTTCGGA AAACTCGAGT CGCACAACCC CTATTCGGTC AAGGATCGGA TCGCGCGCGG GATCATCGAC GCCGCCGAGC GGGCGGGCGC GCTCGAGCCC GACGACACCG TCGTCGAATC GACCAGCGGG AACACGGGCA TCGGGCTGGC GGCGGTCTGT GCCGCTCGCG GCTACGACTG CGTGCTGACG ATGCCGGCCT CGATGTCGAC CGAGCGTCGG CAACTCCTGA GCGCGTTGGG CGCCGACCTC GAGTTGACGC CCGCCGAGGA CGGGATGGGC GGCGCGAACG AGCGCGCCGA GGAGATCGTC GCCGAGCGCG AGGACGCGAT CATGGCCCGC CAGTTCGAGA ACGAGGCGAA CCCGGCGGCC CACCGGGAGA CGACCGGGCC CGAAATCTGG GACGCCACAG ACGGCGCGGT CGACGCGGTC GTCGCGGGCG TCGGTACCGG CGGTACCATC ACCGGCGTCT CGGAGTACAT CAAGGAAGAA CGGGGGAAGA CCGATCTCAC GTCGGTCGCG GTCGAACCGG CCGAATCGCC GACGCTCTCG GAACTCAGTT CCGAGGGCCA CGACATTCAG GGGATCGGTC CCGGCTTCGT CCCCGACATC CTGCGGACCG AACTGATCGA CGAGACCCGC GCCGTCGAGG GTGACGCAGC GAAGGAAGCA TCCCGAAAGC TGGGCCGAAC CGAGGGACTG CTGATCGGCA TCTCCGCGGG CGCGGCGCTG TCGGCCGCGG CCGACTATGC GGCCGAACAC CCCGACGAAC TGGTCGTCGC CGTCCTTCCC GATACCGGTG AGCGGTACCT CTCGACCGAT CTCTACGAAC GGGACTAG
|
Protein sequence | MGPAVDSAAE TEIDVAETVD ELIGRTPLLR LDAFADNCFG KLESHNPYSV KDRIARGIID AAERAGALEP DDTVVESTSG NTGIGLAAVC AARGYDCVLT MPASMSTERR QLLSALGADL ELTPAEDGMG GANERAEEIV AEREDAIMAR QFENEANPAA HRETTGPEIW DATDGAVDAV VAGVGTGGTI TGVSEYIKEE RGKTDLTSVA VEPAESPTLS ELSSEGHDIQ GIGPGFVPDI LRTELIDETR AVEGDAAKEA SRKLGRTEGL LIGISAGAAL SAAADYAAEH PDELVVAVLP DTGERYLSTD LYERD
|
| |