Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1632 |
Symbol | nifS |
ID | 5712777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1696137 |
End bp | 1697183 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641267548 |
Product | cysteine desulfurase |
Protein accession | YP_001532975 |
Protein GI | 159044181 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.422734 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00000261738 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAAGTCAC GCGTCTATCT CGATCACAAT GCGACCGCCC CGCTGCGCAC CGAGGCGCGC ACGGCGATAT GCGCGGCCAT GGACCTTGTG GGAAACCCAT CTTCGGTGCA CGGCGAGGGG CGCGCGGCAA AGGCATTGAT CGAACGGGCG CGGGCGGAGA TCCTGGAGGC ATTGGGTGCG CAGGATTGCG ACCTGGTGTT CACGTCCGGC GCGACGGAGG CCGCAGCCCT CGCCTGCGCC GGGCAGGACC TCGACTGCGC CCCGATCGAG CATGATGCGG TGGCGTTTTG GGCGAACCCC GGTCTCGGGG TTGATGCGGA TGGTACGGTC AAGGTCGAAA CCGCGGGAAG AACGGCGCTG CAGATGGCAA ATTCCGAGAC CGGGATCGTG CAAGACCTGC CCCAGGGGCT CGCGGTTAGC GATCTGACGC AGGCGTTCGG CAAGCTTCCG GTCGCGTTTT CTTGGACGGG AGCACGCATC GGCTTCGTTT CCGCCCACAA GCTGGGCGGC CCCAAGGGAA TCGGCGCTGT CGTGATGCGG CGCGGGGTCG ATCTTTTGGC ACAAATCCGC GGTGGCGGGC AGGAAATGGG CTGGCGTTCT GGCACCGAAA ACCTGATCGG TATCGCTGGA TTTGCGGCTG CAGCCAAGTC CGCGCAGGCA GATCTGGACA AAGGTGTCTG GGCGCGAGTT TCTGAAATTA GAAATATTCT AGAAACAGCC CTTCACTCTG CCAGCTCACA GACTATTTTT GTCGGGAAAG ATGTGCAACG TCTGCCGAAC ACGATCTGTG CTGTGACACC CGGCTGGCGA GGCGAGACGC AAGTGATGCA GATGGATCTC GCCGGCTTCG CGGTGAGCGC AGGCTCCGCC TGTTCCAGCG GGAAGGTGAG GCCCAGCCGC GTGCTGCAGG CCATGGGCTT CAGCCCCGAG GACGCAGCCT GTGCTCTCCG GGTCTCCATC GGCCCGGAAA CGAAAGAAGA GGACGCCCTG CGCTTCGCTG AAGCTTGGGG GAAAGCAAAG AACCGACATG CGGCCCGGGC CGCCTGA
|
Protein sequence | MKSRVYLDHN ATAPLRTEAR TAICAAMDLV GNPSSVHGEG RAAKALIERA RAEILEALGA QDCDLVFTSG ATEAAALACA GQDLDCAPIE HDAVAFWANP GLGVDADGTV KVETAGRTAL QMANSETGIV QDLPQGLAVS DLTQAFGKLP VAFSWTGARI GFVSAHKLGG PKGIGAVVMR RGVDLLAQIR GGGQEMGWRS GTENLIGIAG FAAAAKSAQA DLDKGVWARV SEIRNILETA LHSASSQTIF VGKDVQRLPN TICAVTPGWR GETQVMQMDL AGFAVSAGSA CSSGKVRPSR VLQAMGFSPE DAACALRVSI GPETKEEDAL RFAEAWGKAK NRHAARAA
|
| |