Gene Huta_1228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1228 
Symbol 
ID8383503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1199087 
End bp1200334 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content64% 
IMG OID644972287 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_003130137 
Protein GI257052304 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCAC GTGAAACGGG AGACCTCGAC GTCTCCGCTA TCCGCGAGGA TTTCCCCATC 
CTGGAGCGGG AGTTCGACGG GACGCCGCTG GTCTATCTCG ACAACGCGGC GACGACACAG
ACGCCCCAGC GGGTCATCGA CGCCATCAGC GAGTACTACG AGACCTACAA CGCAAACGTT
CATCGGGGCC TCCACCACCT CAGCCAGGAA GCCAGTGTGG CCTACGAGGA GGCCCACGAT
CGGATGGCCG AGTTCATCGG TGCGAGCGGC GGTCGCGAGG AGTTGATCTT CACCGGCAAC
ACGACCGAAT CGGAGAATCT GGTGGCCTAC GCCTGGGGAC TGAACGAACT CGGTCCCGGC
GACGAGGTCG TCCTGACCGA GATGGAGCAC CACGCCTCGC TGGTGACTTG GCAACAGATC
GCCAAGCGGA CCGGCGCGAC AGTCCGGTAC ATCCGGGTCG ACGAGGACGG GCACCTCGAC
ATGGACCACG CCACGGAACT CATCGGCCCG GATACGGCCA TGGTCTCGGT GGTCCACGTC
TCGAACACGC TCGGGACGAT CAACCCCGTC GCCGAATTGG CCGACCTCGC CCACGCCGAG
GACGCCTTCA TCTTCGTCGA CGGTGCCCAG GCCGTCCCCA ACCGGCCGGT CGATGTCGAA
GCGATCGACG CTGACTTCTA CGCCTTTTCG GGCCACAAGA TGGCCGGCCC GACCGGGATC
GGTGCGCTCT ACGGCAAGCA AGCGATCCTT GAACACATGG AGCCGTTCAA CTACGGCGGC
GACATGATCA CGAAGGTCAC CTACGAGGAC GCGACCTGGA ACGAACTGCC CTGGAAGTTC
GAGGCCGGGA CACCCAAGAT TGCCCAGGGG ATCGCACTGG CGGAAGCCGC CGACTACCTC
GACGAGATCG GGCTGGACGC CATCGCCCGC CACGAGAACG AACTCGCCCA GTACGCCATC
GACCGGCTGA GCGAGTTCGA CGACATCGAG ATCTACGGTC CGTCCGCAGG CGAGGAGCGG
GGTGGTCTGG TCTCGTTCAA TCTGGAATCA GTCCACGCCC ACGACCTCTC CTCTATCCTG
AACGACTACG CCGTCGCGAT CCGGGCCGGC GATCACTGCA CCCAGCCGCT GCACGATAAG
CTGGGCGTGG CTGCGTCTGC GCGCGCGTCG TTTTATCTTT ACAATACCCG CGACGAGATT
GACGTGCTGA TCGACGCCAT TGACGACGCT CGCCAGTTGT TCGGCTGA
 
Protein sequence
MTARETGDLD VSAIREDFPI LEREFDGTPL VYLDNAATTQ TPQRVIDAIS EYYETYNANV 
HRGLHHLSQE ASVAYEEAHD RMAEFIGASG GREELIFTGN TTESENLVAY AWGLNELGPG
DEVVLTEMEH HASLVTWQQI AKRTGATVRY IRVDEDGHLD MDHATELIGP DTAMVSVVHV
SNTLGTINPV AELADLAHAE DAFIFVDGAQ AVPNRPVDVE AIDADFYAFS GHKMAGPTGI
GALYGKQAIL EHMEPFNYGG DMITKVTYED ATWNELPWKF EAGTPKIAQG IALAEAADYL
DEIGLDAIAR HENELAQYAI DRLSEFDDIE IYGPSAGEER GGLVSFNLES VHAHDLSSIL
NDYAVAIRAG DHCTQPLHDK LGVAASARAS FYLYNTRDEI DVLIDAIDDA RQLFG