Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0437 |
Symbol | |
ID | 6262573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | + |
Start bp | 471461 |
End bp | 472621 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 642610907 |
Product | cysteine desulfurase NifS |
Protein accession | YP_001875331 |
Protein GI | 187250849 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.361233 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGTAA TTTATCTTGA CAATAACGCA ACAACGCGTA CAGCACCTGA AGTTGTTAAG GAAATGCTTC CTTATTTTTC CGAACATTAC GGCAATGCTT CAAGCATGCA TACTTTTGGC GGGGAAAATA AAAAAGTTAT CGAGGACGCC AGAAAAAAAA TGGCCGCCCT TATAGGCGCC CAATATCCGG ACGAAATTAT TATTACCGCG GGCGGCACGG AAGCGGACAA TACGGCAATA ATGTCTGCCA TAAATTCTTT TCCCGATAAA AAACATATTA TTACCTCAGC TGTGGAACAC CCGGCCGTTT TGGAAGTTTT TAAAAACCTG CAGGCCAAAG GATATAAGGT TGATTATATA GGCGTTGATA AAAACGGCAG GTTTAATATG GACGAATTTA AAGCAGCCGT TAATGAAAAC ACGGCCTTAG TTTCCATAAT GTGGGCCAAC AGCGAAACGG GCACAATTTT CCCTATAGAA GAAATAGCAA AAATAACCAA AGAGGCGGGC AGCGTTTTTC ATACAGACGC TGTGCAAGCT GTCGGTAAAA TACCCGTTAA CGTGGCGGAT ACGGATATAA ACATGCTTTC TTTTTCGGCT CACAAGTTTC ACGGGCCTAA AGGTATAGGC GCTTTATATG TAAAACGCAG AACACGCTTT ATGCCTTTTA TAATAGGCGG ACACCAGGAA AAAGGGCACA GGGCAGGCAC GGAAAATGTG CCCGCTATAG CGGGTTTCGG CAAAGCGTGT GAAATGGCGT TGGAGAATTT AAAAAACACT TCTAAAACAG CCGTTTTAAG GGACAGGCTT GAAAAGGGTC TCCTTGCAAA AATTTCTCAT TCAAAAGTTA ACGGTGATGT TGAAAACAGG CTTCCTAATA CGTCAAATAT AAGTTTTGGC TATATTGAAG GGGAATCAAT ACTTTTACAT TTAAACGATT ACGGCATTTG CGCTTCTTCA GGTTCGGCCT GCACGTCCGG AAGTTTGGAG CCGAGCCACG TTTTAAGAGC AATGTGCGTT GATTTTAATT TTGCGCACGG TTCGGTAAGG TTTTCTTTAA GCGATGAAAA TACAGAACAG GAAATTGATT TTGTTATAGA AAAACTGCCG CCCATAATCG AGACGCTTCG CCAAATATCA CCTTTCGGCC GCCGGAGCTA G
|
Protein sequence | MKVIYLDNNA TTRTAPEVVK EMLPYFSEHY GNASSMHTFG GENKKVIEDA RKKMAALIGA QYPDEIIITA GGTEADNTAI MSAINSFPDK KHIITSAVEH PAVLEVFKNL QAKGYKVDYI GVDKNGRFNM DEFKAAVNEN TALVSIMWAN SETGTIFPIE EIAKITKEAG SVFHTDAVQA VGKIPVNVAD TDINMLSFSA HKFHGPKGIG ALYVKRRTRF MPFIIGGHQE KGHRAGTENV PAIAGFGKAC EMALENLKNT SKTAVLRDRL EKGLLAKISH SKVNGDVENR LPNTSNISFG YIEGESILLH LNDYGICASS GSACTSGSLE PSHVLRAMCV DFNFAHGSVR FSLSDENTEQ EIDFVIEKLP PIIETLRQIS PFGRRS
|
| |