Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1228 |
Symbol | |
ID | 8383503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 1199087 |
End bp | 1200334 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644972287 |
Product | cysteine desulfurase, SufS subfamily |
Protein accession | YP_003130137 |
Protein GI | 257052304 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGCAC GTGAAACGGG AGACCTCGAC GTCTCCGCTA TCCGCGAGGA TTTCCCCATC CTGGAGCGGG AGTTCGACGG GACGCCGCTG GTCTATCTCG ACAACGCGGC GACGACACAG ACGCCCCAGC GGGTCATCGA CGCCATCAGC GAGTACTACG AGACCTACAA CGCAAACGTT CATCGGGGCC TCCACCACCT CAGCCAGGAA GCCAGTGTGG CCTACGAGGA GGCCCACGAT CGGATGGCCG AGTTCATCGG TGCGAGCGGC GGTCGCGAGG AGTTGATCTT CACCGGCAAC ACGACCGAAT CGGAGAATCT GGTGGCCTAC GCCTGGGGAC TGAACGAACT CGGTCCCGGC GACGAGGTCG TCCTGACCGA GATGGAGCAC CACGCCTCGC TGGTGACTTG GCAACAGATC GCCAAGCGGA CCGGCGCGAC AGTCCGGTAC ATCCGGGTCG ACGAGGACGG GCACCTCGAC ATGGACCACG CCACGGAACT CATCGGCCCG GATACGGCCA TGGTCTCGGT GGTCCACGTC TCGAACACGC TCGGGACGAT CAACCCCGTC GCCGAATTGG CCGACCTCGC CCACGCCGAG GACGCCTTCA TCTTCGTCGA CGGTGCCCAG GCCGTCCCCA ACCGGCCGGT CGATGTCGAA GCGATCGACG CTGACTTCTA CGCCTTTTCG GGCCACAAGA TGGCCGGCCC GACCGGGATC GGTGCGCTCT ACGGCAAGCA AGCGATCCTT GAACACATGG AGCCGTTCAA CTACGGCGGC GACATGATCA CGAAGGTCAC CTACGAGGAC GCGACCTGGA ACGAACTGCC CTGGAAGTTC GAGGCCGGGA CACCCAAGAT TGCCCAGGGG ATCGCACTGG CGGAAGCCGC CGACTACCTC GACGAGATCG GGCTGGACGC CATCGCCCGC CACGAGAACG AACTCGCCCA GTACGCCATC GACCGGCTGA GCGAGTTCGA CGACATCGAG ATCTACGGTC CGTCCGCAGG CGAGGAGCGG GGTGGTCTGG TCTCGTTCAA TCTGGAATCA GTCCACGCCC ACGACCTCTC CTCTATCCTG AACGACTACG CCGTCGCGAT CCGGGCCGGC GATCACTGCA CCCAGCCGCT GCACGATAAG CTGGGCGTGG CTGCGTCTGC GCGCGCGTCG TTTTATCTTT ACAATACCCG CGACGAGATT GACGTGCTGA TCGACGCCAT TGACGACGCT CGCCAGTTGT TCGGCTGA
|
Protein sequence | MTARETGDLD VSAIREDFPI LEREFDGTPL VYLDNAATTQ TPQRVIDAIS EYYETYNANV HRGLHHLSQE ASVAYEEAHD RMAEFIGASG GREELIFTGN TTESENLVAY AWGLNELGPG DEVVLTEMEH HASLVTWQQI AKRTGATVRY IRVDEDGHLD MDHATELIGP DTAMVSVVHV SNTLGTINPV AELADLAHAE DAFIFVDGAQ AVPNRPVDVE AIDADFYAFS GHKMAGPTGI GALYGKQAIL EHMEPFNYGG DMITKVTYED ATWNELPWKF EAGTPKIAQG IALAEAADYL DEIGLDAIAR HENELAQYAI DRLSEFDDIE IYGPSAGEER GGLVSFNLES VHAHDLSSIL NDYAVAIRAG DHCTQPLHDK LGVAASARAS FYLYNTRDEI DVLIDAIDDA RQLFG
|
| |