Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0281 |
Symbol | nifS |
ID | 4239483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 280926 |
End bp | 282140 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638103821 |
Product | cysteine desulfurase |
Protein accession | YP_718489 |
Protein GI | 113460427 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR02006] cysteine desulfurase IscS [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.910551 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTAC CTATTTATTT AGATTATGCG GCAACTTGCC CTGTGGATGA ACGTGTAGTG AAGAAAATGA TGGAGTTTTT AAGCATTGAT GGAAATTTTG GAAACCCAGC ATCTCGCTCA CATAAATTTG GTTGGCAGGC TGAAGAAGCG GTTGATGTTG CTCGCAACCA TATTGCTGAT TTAATCGGTG CGGACAGTCG TGAGATTGTT TTTACTTCTG GTGCAACAGA AGCGGACAAC CTTGCATTGA AAGGAGTGAT GCGTTTCTAT CAAACTAAAG GGAAACATCT TATTACCTGT AAAACTGAAC ATAAAGCGAT CTTAGACACT TGTCGCCAGC TTGAACGTGA AGGGTTTGAA GTGACTTATT TAGATCCCAA ATCAGATGGT TTGATAGATT TAGAGGAATT AAAATCCGTT ATTCGTGATG ACACGGTTTT GGTTTCTATA ATGCACGCCA ATAATGAGAT TGGTGTAGTA CAGGATATTG CTAAAATTGG TGAAATTTGC CGTGAACGTA AGGTATTATT TCACACCGAT GCAACTCAAT CCGTCGGTAA ATTGCCCATT AATTTGAGTG AATTAAAAGT AGATTTACTT TCTATGTCCA GTCATAAATT ATATGGACCT AAAGGTATAG GTGCTTTATA TGTTTGCCGT AAACCACGAG TTCGTTTGGA GGCAATTATT CATGGCGGCG GTCATGAGCG TGGTATGCGT TCAGGAACCT TACCTGTACA TCAGATTGTG GGCATGGGTG AAGCATATCG TATTGCTAAA GAAGAAATGG CAACAGAAAT GCCACGTTTA ACCGCTTTGC GTGATCGTTT ATATAACGGC TTGAAAGATA TAGAAGAAAC CTATGTAAAC GGTTCAATGG AACAACGTTT GGGGAATAAT TTAAATATTA GTTTTAATTA CGTTGAAGGT GAAAGTTTAA TGATGGCGTT ACGTGATATT GCGGTGTCTT CCGGTTCTGC CTGTACATCA GCAAGCCTTG AACCTTCTTA TGTGTTGCGT GCATTAGGAC TGAATGACGA ATTGGCACAC AGCTCAATTC GTTTTACTGT GGGGCGTTAT ACTACCGCAG AAGAAATTGA TTATGCTATT GGCTTAGTTA AAAGTGCGGT GGAAAAGTTA CGTGATTTAT CTCCACTTTG GGATATGTTC AAAGAGGGTA TTGATTTAAA CAGTATTGAA TGGACTCATC ATTAA
|
Protein sequence | MKLPIYLDYA ATCPVDERVV KKMMEFLSID GNFGNPASRS HKFGWQAEEA VDVARNHIAD LIGADSREIV FTSGATEADN LALKGVMRFY QTKGKHLITC KTEHKAILDT CRQLEREGFE VTYLDPKSDG LIDLEELKSV IRDDTVLVSI MHANNEIGVV QDIAKIGEIC RERKVLFHTD ATQSVGKLPI NLSELKVDLL SMSSHKLYGP KGIGALYVCR KPRVRLEAII HGGGHERGMR SGTLPVHQIV GMGEAYRIAK EEMATEMPRL TALRDRLYNG LKDIEETYVN GSMEQRLGNN LNISFNYVEG ESLMMALRDI AVSSGSACTS ASLEPSYVLR ALGLNDELAH SSIRFTVGRY TTAEEIDYAI GLVKSAVEKL RDLSPLWDMF KEGIDLNSIE WTHH
|
| |