Gene Htur_0224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_0224 
Symbol 
ID8740787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp245548 
End bp246792 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content65% 
IMG OID646510787 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_003401798 
Protein GI284163519 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAAC AGAACCTCGA GTCGCTCGAC GTCGGAGCGA TTCGCGACGA GTTCCCCATC 
CTCGAGCGCG AGTTCGACGG CCAGCAGGTC GTCTACCTCG ACAACGCGGC GACGACCCAG
ACTCCCGATC CGGTCGTCGA CGCGATGAGC GACTACTACC GCGAGTCGAA CGCGAACATC
CACCGGGGTA TTCACCACCT CAGTCAGGAG GCCTCCATCA TGTACGAGGA GGCCCACGAC
CGCGTCGCGG AGTTCATCAA CGCCGACGGC CGCGAGGAGG TCATCTTCAC CAAGAACACG
ACTGAGGGCG AGAACCTCAT CGCCTACGCG TGGGGCCTGA ACGAACTCGG CCCCGGCGAC
GAGATCGTCC TCACGGAGAT GGAACACCAC GCCTCGCTGG TCACGTGGCA ACAGATCGGC
AAGCGAACCG GCGCCGACGT GAAGTACATC CGGATCGACG AGGACCAGCG CCTCGACATG
GACCACGCCC GCGAACTGAT CACCGACGAC ACCGCCATCG TCTCGGCGGT CCACGTCTCG
AACACGCTGG GCACGGTCAA CCCCGTCTCC GAACTCACGG ATCTCGCCCA CGAGCACGAT
GCGCTCTCCT TCATCGACGG CGCGCAGGCA GTCCCTAACC GCCCCGTCGA CGTCAAGGCC
ATCGACGCCG ACTTCTACGC CTTCTCCGGC CACAAGATGG CCGGCCCCAC CGGGATCGGC
GTCCTCTACG GCAAGCAGCA CCTCCTCGAG GAGATGGAGC CGTACCTCTA CGGCGGCGGC
ATGATCCGGA AGGTCACCTA CGAGGACTCC ACGTGGGGCG ACCTCCCCTG GAAGTTCGAA
CCCGGAACGC CCCAGATCGC CGAGGCCGTC GGCCTCGAGG CCGCCATCGA CTGGCTCGAG
GACATCGGCA TGGAGCGCAT TCAGGCCCAC GAGGAGGAGA TCGCCCGCTA CGCTTACGAG
CGACTCGAGA GCGAGGAGGA CGTCGAGATC TACGGCCCAG AGCCGGGCCC CGACCGTGGC
GGTCTCGTCA GCTTCAACGT CGAGGGCGTC CACGCCCACG ACCTGGCCTC GATCATGAAC
GACCACACGA TCGCGGTCCG GGCCGGCGAC CACTGTACCC AGCCGCTCCA CGACAAGCTC
GGCGTGCCGG CCTCGACTCG AGCGTCGTTC TACGTCTACA ACACGCGAGA GGAAGTCGAC
AAGTTGGTCG CGGCGCTCGA CGACGCGCGA CAGCTGTTCG CGTAA
 
Protein sequence
MSQQNLESLD VGAIRDEFPI LEREFDGQQV VYLDNAATTQ TPDPVVDAMS DYYRESNANI 
HRGIHHLSQE ASIMYEEAHD RVAEFINADG REEVIFTKNT TEGENLIAYA WGLNELGPGD
EIVLTEMEHH ASLVTWQQIG KRTGADVKYI RIDEDQRLDM DHARELITDD TAIVSAVHVS
NTLGTVNPVS ELTDLAHEHD ALSFIDGAQA VPNRPVDVKA IDADFYAFSG HKMAGPTGIG
VLYGKQHLLE EMEPYLYGGG MIRKVTYEDS TWGDLPWKFE PGTPQIAEAV GLEAAIDWLE
DIGMERIQAH EEEIARYAYE RLESEEDVEI YGPEPGPDRG GLVSFNVEGV HAHDLASIMN
DHTIAVRAGD HCTQPLHDKL GVPASTRASF YVYNTREEVD KLVAALDDAR QLFA