Gene HS_0281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0281 
SymbolnifS 
ID4239483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp280926 
End bp282140 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content39% 
IMG OID638103821 
Productcysteine desulfurase 
Protein accessionYP_718489 
Protein GI113460427 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR02006] cysteine desulfurase IscS
[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.910551 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAC CTATTTATTT AGATTATGCG GCAACTTGCC CTGTGGATGA ACGTGTAGTG 
AAGAAAATGA TGGAGTTTTT AAGCATTGAT GGAAATTTTG GAAACCCAGC ATCTCGCTCA
CATAAATTTG GTTGGCAGGC TGAAGAAGCG GTTGATGTTG CTCGCAACCA TATTGCTGAT
TTAATCGGTG CGGACAGTCG TGAGATTGTT TTTACTTCTG GTGCAACAGA AGCGGACAAC
CTTGCATTGA AAGGAGTGAT GCGTTTCTAT CAAACTAAAG GGAAACATCT TATTACCTGT
AAAACTGAAC ATAAAGCGAT CTTAGACACT TGTCGCCAGC TTGAACGTGA AGGGTTTGAA
GTGACTTATT TAGATCCCAA ATCAGATGGT TTGATAGATT TAGAGGAATT AAAATCCGTT
ATTCGTGATG ACACGGTTTT GGTTTCTATA ATGCACGCCA ATAATGAGAT TGGTGTAGTA
CAGGATATTG CTAAAATTGG TGAAATTTGC CGTGAACGTA AGGTATTATT TCACACCGAT
GCAACTCAAT CCGTCGGTAA ATTGCCCATT AATTTGAGTG AATTAAAAGT AGATTTACTT
TCTATGTCCA GTCATAAATT ATATGGACCT AAAGGTATAG GTGCTTTATA TGTTTGCCGT
AAACCACGAG TTCGTTTGGA GGCAATTATT CATGGCGGCG GTCATGAGCG TGGTATGCGT
TCAGGAACCT TACCTGTACA TCAGATTGTG GGCATGGGTG AAGCATATCG TATTGCTAAA
GAAGAAATGG CAACAGAAAT GCCACGTTTA ACCGCTTTGC GTGATCGTTT ATATAACGGC
TTGAAAGATA TAGAAGAAAC CTATGTAAAC GGTTCAATGG AACAACGTTT GGGGAATAAT
TTAAATATTA GTTTTAATTA CGTTGAAGGT GAAAGTTTAA TGATGGCGTT ACGTGATATT
GCGGTGTCTT CCGGTTCTGC CTGTACATCA GCAAGCCTTG AACCTTCTTA TGTGTTGCGT
GCATTAGGAC TGAATGACGA ATTGGCACAC AGCTCAATTC GTTTTACTGT GGGGCGTTAT
ACTACCGCAG AAGAAATTGA TTATGCTATT GGCTTAGTTA AAAGTGCGGT GGAAAAGTTA
CGTGATTTAT CTCCACTTTG GGATATGTTC AAAGAGGGTA TTGATTTAAA CAGTATTGAA
TGGACTCATC ATTAA
 
Protein sequence
MKLPIYLDYA ATCPVDERVV KKMMEFLSID GNFGNPASRS HKFGWQAEEA VDVARNHIAD 
LIGADSREIV FTSGATEADN LALKGVMRFY QTKGKHLITC KTEHKAILDT CRQLEREGFE
VTYLDPKSDG LIDLEELKSV IRDDTVLVSI MHANNEIGVV QDIAKIGEIC RERKVLFHTD
ATQSVGKLPI NLSELKVDLL SMSSHKLYGP KGIGALYVCR KPRVRLEAII HGGGHERGMR
SGTLPVHQIV GMGEAYRIAK EEMATEMPRL TALRDRLYNG LKDIEETYVN GSMEQRLGNN
LNISFNYVEG ESLMMALRDI AVSSGSACTS ASLEPSYVLR ALGLNDELAH SSIRFTVGRY
TTAEEIDYAI GLVKSAVEKL RDLSPLWDMF KEGIDLNSIE WTHH