Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0509 |
Symbol | |
ID | 8382776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 516216 |
End bp | 517124 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644971571 |
Product | cysteine synthase |
Protein accession | YP_003129429 |
Protein GI | 257051596 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.445859 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGATA GCATTCTCGA GACGATCGGC TCGCCGCTGG TCGAGATCGA CTCGCCACCG GGCGCGACCG TCGCCGCCAA AGTCGAATCG TTCAATCCCG GGGGGTCGGC GAAGGACCGT CCCGCACTCG GGATGGTCGA GGCCGCCGAA CGCGACGGTG ACCTCTCTCC AGGTGACCGG ATCGTCGAGC CGACCAGCGG GAACACCGGG ATCGGCATCT CGCTGGTCGC GGCCGCGAAG GGGTACGACG TGACCATCGT CATGCCAGCG GACATGTCCG TCGAGCGCCG CCGGCTAATG GAAGCCTACG GGGCGGATCT GGAACTGATC GAGGGCGACA TGACCGACGC CCGGGATCGG GCTGATGCAC TCGAAGCCGA TGAAGGGATG GTCCAGCTCC GGCAGTTCGA GAACCCAGCG AACCCCCAGG CTCATTACGA GACAACGGGG CCCGAAATTC TCGAACAGGT TGGTGATCGG GAAATCGACG CGTTCGTCGC CGGCGTGGGC ACTGGCGGGA CGATCAGCGG GACGGCCCGG CGGCTTCGTG AGGCATTCCA TGATGTGGAC GTGATCGGCG TCGAACCCGC CGAGAATGCG GTCCTCTCGA CGGGCGAATC GGGCTCGGAC GACTTCCAGG GCATGGGGCC GGGGTTCGTC AGCGACAACC TGGATCGGGA CGTGATCGAC GAGGTCCGGA CGATCGAACT CGCGGATGCA GAAGCGGAAT GTCGGCGCCT TGCCCGTGCG GAAGGATTGC TCGTCGGCCA ATCGAGCGGT GCAATGGGTG TGATTGCCCG GGAAGTAGCC GCGGAACGGG CTGCTCCCGA CGCCGAGGAA CCACCGCTGA TCGTGACCGT CTTCTGGGAC AGTGGCGAGC GGTATCTTTC AACCGGATTG TTCGATTGA
|
Protein sequence | MDDSILETIG SPLVEIDSPP GATVAAKVES FNPGGSAKDR PALGMVEAAE RDGDLSPGDR IVEPTSGNTG IGISLVAAAK GYDVTIVMPA DMSVERRRLM EAYGADLELI EGDMTDARDR ADALEADEGM VQLRQFENPA NPQAHYETTG PEILEQVGDR EIDAFVAGVG TGGTISGTAR RLREAFHDVD VIGVEPAENA VLSTGESGSD DFQGMGPGFV SDNLDRDVID EVRTIELADA EAECRRLARA EGLLVGQSSG AMGVIAREVA AERAAPDAEE PPLIVTVFWD SGERYLSTGL FD
|
| |