Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_0224 |
Symbol | |
ID | 8740787 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 245548 |
End bp | 246792 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646510787 |
Product | cysteine desulfurase, SufS subfamily |
Protein accession | YP_003401798 |
Protein GI | 284163519 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCAAC AGAACCTCGA GTCGCTCGAC GTCGGAGCGA TTCGCGACGA GTTCCCCATC CTCGAGCGCG AGTTCGACGG CCAGCAGGTC GTCTACCTCG ACAACGCGGC GACGACCCAG ACTCCCGATC CGGTCGTCGA CGCGATGAGC GACTACTACC GCGAGTCGAA CGCGAACATC CACCGGGGTA TTCACCACCT CAGTCAGGAG GCCTCCATCA TGTACGAGGA GGCCCACGAC CGCGTCGCGG AGTTCATCAA CGCCGACGGC CGCGAGGAGG TCATCTTCAC CAAGAACACG ACTGAGGGCG AGAACCTCAT CGCCTACGCG TGGGGCCTGA ACGAACTCGG CCCCGGCGAC GAGATCGTCC TCACGGAGAT GGAACACCAC GCCTCGCTGG TCACGTGGCA ACAGATCGGC AAGCGAACCG GCGCCGACGT GAAGTACATC CGGATCGACG AGGACCAGCG CCTCGACATG GACCACGCCC GCGAACTGAT CACCGACGAC ACCGCCATCG TCTCGGCGGT CCACGTCTCG AACACGCTGG GCACGGTCAA CCCCGTCTCC GAACTCACGG ATCTCGCCCA CGAGCACGAT GCGCTCTCCT TCATCGACGG CGCGCAGGCA GTCCCTAACC GCCCCGTCGA CGTCAAGGCC ATCGACGCCG ACTTCTACGC CTTCTCCGGC CACAAGATGG CCGGCCCCAC CGGGATCGGC GTCCTCTACG GCAAGCAGCA CCTCCTCGAG GAGATGGAGC CGTACCTCTA CGGCGGCGGC ATGATCCGGA AGGTCACCTA CGAGGACTCC ACGTGGGGCG ACCTCCCCTG GAAGTTCGAA CCCGGAACGC CCCAGATCGC CGAGGCCGTC GGCCTCGAGG CCGCCATCGA CTGGCTCGAG GACATCGGCA TGGAGCGCAT TCAGGCCCAC GAGGAGGAGA TCGCCCGCTA CGCTTACGAG CGACTCGAGA GCGAGGAGGA CGTCGAGATC TACGGCCCAG AGCCGGGCCC CGACCGTGGC GGTCTCGTCA GCTTCAACGT CGAGGGCGTC CACGCCCACG ACCTGGCCTC GATCATGAAC GACCACACGA TCGCGGTCCG GGCCGGCGAC CACTGTACCC AGCCGCTCCA CGACAAGCTC GGCGTGCCGG CCTCGACTCG AGCGTCGTTC TACGTCTACA ACACGCGAGA GGAAGTCGAC AAGTTGGTCG CGGCGCTCGA CGACGCGCGA CAGCTGTTCG CGTAA
|
Protein sequence | MSQQNLESLD VGAIRDEFPI LEREFDGQQV VYLDNAATTQ TPDPVVDAMS DYYRESNANI HRGIHHLSQE ASIMYEEAHD RVAEFINADG REEVIFTKNT TEGENLIAYA WGLNELGPGD EIVLTEMEHH ASLVTWQQIG KRTGADVKYI RIDEDQRLDM DHARELITDD TAIVSAVHVS NTLGTVNPVS ELTDLAHEHD ALSFIDGAQA VPNRPVDVKA IDADFYAFSG HKMAGPTGIG VLYGKQHLLE EMEPYLYGGG MIRKVTYEDS TWGDLPWKFE PGTPQIAEAV GLEAAIDWLE DIGMERIQAH EEEIARYAYE RLESEEDVEI YGPEPGPDRG GLVSFNVEGV HAHDLASIMN DHTIAVRAGD HCTQPLHDKL GVPASTRASF YVYNTREEVD KLVAALDDAR QLFA
|
| |