Gene Hlac_2248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2248 
Symbol 
ID7399958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2236834 
End bp2238111 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content67% 
IMG OID643709322 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_002566895 
Protein GI222480658 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0748033 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0430442 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGTTC AGGAACAGTA CCCGTTCGAC GTCGAGGCCG TCCGCGCCGA CTTCCCGATC 
CTCGATCGGC TGGTCGGCGG CGATCCGGAG TCGCCCGGCG AGGGTCCCGG CGACGACACG
CCGCTCGTCT ATCTCGACAG CGCGGCGACC TCGCAAACGC CGGATCCGGT CGTCGACACG
ATTGTGGACT ACTACCGCGG CTACAACGCC AACGTCCACC GCGGGATCCA TCAGCTGAGT
CAGGAGGCCT CCGTCGCCTA CGAGGAGGCC CACGACACCG TCGCGGACTT CATCGGCGCG
TCGGGCCGCG AGGAGATCGT CTTCACGAAA AACACCACGG AGGCGATGAA CCTCGTCGCA
TACGCGTGGG GGCTCGAAGA ACTCGGGCCG GGCGACAACG TCGTCCTCTC GCAGATGGAA
CACCACGCGT CGCTGGTGAC GTGGCAGCAG ATCGGGAAGC GAACCGGCGC CGACGTGCGG
TTCATCGAGG TGACCGACGA GGGCCGGCTC GACATGGAAC ACGCCGCGGA GCTCATCGAC
GACGACACGC AGATGGTGTC GGTCGTCCAC GTCTCGAACA CGCTGGGCAC GATCAATCCG
ATCTCGGAGC TGGCCGACCT CGCGCACGAC CACGACGCGT ACGTCTTCGC CGACGGCGCG
CAGTCGGTGC CGACTCGGCC GGTCGACGTC GACGACCTCG GCGTGGACTT CCTCGCCTTT
TCCGGGCACA AGATGTGCGG CCCGACCGGT ATCGGGGCGC TGTACGGCCG CGAGGAGATC
CTCGACGAGG TGCAGCCGTA CCTCTACGGC GGTGACATGA TCCGACGCGT CTCCTTTACG
GACTCCACGT GGGAAGACCT CCCGTGGAAG TTCGAGGCCG GCACGCCTTC GATCGCGCAG
GGGATCGCCT TCGCGGCCGC GATCGAGTAT CTGGAAGAGA TCGGCATGCA GAACGTGCAG
GCCCACGAGG ATCTGCTGGC GGAGTACGCG TACGACGAGC TGACTGACCT CGGCGGCGTG
GAGATCTACG GGCCGCCGGG CAACGACCGC GGCGGTCTCG TCGCGTTCAA CGTCGAGGGC
GTCCACGCCC ACGATCTGTC CAGCATCCTC AACGACTACG GCGTCGCGAT CCGTGCCGGC
GACCACTGCA CCCAGCCACT CCACGACGAG CTTGGCGTCG CCGCCTCCGC GCGCGCCTCC
TTCTACCTCT ACAACACCGT CGAGGAGATC GACGCCTTGG TCGAGGCTGT CGGTGAGGCG
CGCGACCTGT TCGCGTAG
 
Protein sequence
MGVQEQYPFD VEAVRADFPI LDRLVGGDPE SPGEGPGDDT PLVYLDSAAT SQTPDPVVDT 
IVDYYRGYNA NVHRGIHQLS QEASVAYEEA HDTVADFIGA SGREEIVFTK NTTEAMNLVA
YAWGLEELGP GDNVVLSQME HHASLVTWQQ IGKRTGADVR FIEVTDEGRL DMEHAAELID
DDTQMVSVVH VSNTLGTINP ISELADLAHD HDAYVFADGA QSVPTRPVDV DDLGVDFLAF
SGHKMCGPTG IGALYGREEI LDEVQPYLYG GDMIRRVSFT DSTWEDLPWK FEAGTPSIAQ
GIAFAAAIEY LEEIGMQNVQ AHEDLLAEYA YDELTDLGGV EIYGPPGNDR GGLVAFNVEG
VHAHDLSSIL NDYGVAIRAG DHCTQPLHDE LGVAASARAS FYLYNTVEEI DALVEAVGEA
RDLFA