Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_2190 |
Symbol | |
ID | 8544576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 3051381 |
End bp | 3052601 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646386897 |
Product | Cysteine desulfurase |
Protein accession | YP_003266628 |
Protein GI | 262195419 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.980213 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0116553 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATCG ACCTGCCCAT CTACCTCGAC AACCACGCCA CGACGCCCTG CGATCCGCGC GTGCTCGAGG TCATGCTGCC CTACTTCACC GAGCACTGCG GCAACGCCGC CAGCCGCAGC CATGTCTTCG GCTGGACGGC CGAAGCCGCC GTGGACTACG CGCGCGAACA AGTCGCCACG CTGATCGGGG CCTCGCCGCG CGAGATCGTG TTCACCTCGG GCGCCACCGA GGCGGACAAC CTGGCCATCA AGGGCGTGGC CGCGACCCGG CACACCCAGG GCAAGCACAT CATCACCGCC GTGACCGAGC ACAAAGCCGT GCTCGACTCG TGCGCGCGTC TGGCCCGCGA GGGCTTCGAG ATCACGTATC TCACGCCCGG CCCCGACGGT CTGCTCACGC CGGCCCAGGT CGCCGAGGCG GTGCGCGACG ACACCATCTT GGTGAGCGTC ATGTTCGTCA ACAACGAGAT CGGCGTGGTG CAGCCCATCG CCGAGATCGC GGCGGCGGTC AAAGCCGCCA ACGAGCGCGC GCTGGTTCTG TGCGACGCGG TCCAGGGCGT CGGCAAGCTG CCCTTTGCTG TCGAGGACAT GGGCGTGGAT CTCGTCGCGC TGTCCGCCCA TAAAATGTAC GGCCCCAAAG GCGTGGGCGC GCTGTGGGTG CGACGCCGGC CGCGCGTGCG CCTCGAGCCG CTGATCGACG GCGGCGGTCA CGAGCGCGGC CTGCGCTCGG GCACCCTGCC GGTGCCGCTG ATCATCGGTT TCGGCATGGC CTGCGAGCTG TCTCAAGAGA GCATGGACGA GGAGGCCGCG CGCACCGCCG GGCTGCGCGA TCGCCTGCTG CACGGGCTGC GAACGCGCCT CGATGGGGTC TCGGTCAATG GCTCGCTGGA GCACCGCGTG CCCGGCAACC TCAACCTCTC CTTTGCCGGC GTGGACGGCG AATCACTGCT CATGTCGCTC AAAGACGTGG CGGTGTCCTC GGGCTCCGCG TGCACCTCGG CGACGCAGGC GCCGAGCTAC GTGCTGCGCG CGCTGGGCGT GGACGATGAA TTGGCGCAGG CGTCGCTGCG CTTTGGCGTC GGTCGCTTCA ACACCGAAGC GCAGATCGAC TACGTCATCG AGTTGTGCGC GGACGCGGTC GAGCGCCTGC GCGCGCTCGG ACCTTCCCCC GCCGATATCG CCGGCGATGC GCCGGTGTGT GAGCGGCCTG GAGACGATTA G
|
Protein sequence | MSIDLPIYLD NHATTPCDPR VLEVMLPYFT EHCGNAASRS HVFGWTAEAA VDYAREQVAT LIGASPREIV FTSGATEADN LAIKGVAATR HTQGKHIITA VTEHKAVLDS CARLAREGFE ITYLTPGPDG LLTPAQVAEA VRDDTILVSV MFVNNEIGVV QPIAEIAAAV KAANERALVL CDAVQGVGKL PFAVEDMGVD LVALSAHKMY GPKGVGALWV RRRPRVRLEP LIDGGGHERG LRSGTLPVPL IIGFGMACEL SQESMDEEAA RTAGLRDRLL HGLRTRLDGV SVNGSLEHRV PGNLNLSFAG VDGESLLMSL KDVAVSSGSA CTSATQAPSY VLRALGVDDE LAQASLRFGV GRFNTEAQID YVIELCADAV ERLRALGPSP ADIAGDAPVC ERPGDD
|
| |