Gene Emin_0437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0437 
Symbol 
ID6262573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp471461 
End bp472621 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content43% 
IMG OID642610907 
Productcysteine desulfurase NifS 
Protein accessionYP_001875331 
Protein GI187250849 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.361233 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTAA TTTATCTTGA CAATAACGCA ACAACGCGTA CAGCACCTGA AGTTGTTAAG 
GAAATGCTTC CTTATTTTTC CGAACATTAC GGCAATGCTT CAAGCATGCA TACTTTTGGC
GGGGAAAATA AAAAAGTTAT CGAGGACGCC AGAAAAAAAA TGGCCGCCCT TATAGGCGCC
CAATATCCGG ACGAAATTAT TATTACCGCG GGCGGCACGG AAGCGGACAA TACGGCAATA
ATGTCTGCCA TAAATTCTTT TCCCGATAAA AAACATATTA TTACCTCAGC TGTGGAACAC
CCGGCCGTTT TGGAAGTTTT TAAAAACCTG CAGGCCAAAG GATATAAGGT TGATTATATA
GGCGTTGATA AAAACGGCAG GTTTAATATG GACGAATTTA AAGCAGCCGT TAATGAAAAC
ACGGCCTTAG TTTCCATAAT GTGGGCCAAC AGCGAAACGG GCACAATTTT CCCTATAGAA
GAAATAGCAA AAATAACCAA AGAGGCGGGC AGCGTTTTTC ATACAGACGC TGTGCAAGCT
GTCGGTAAAA TACCCGTTAA CGTGGCGGAT ACGGATATAA ACATGCTTTC TTTTTCGGCT
CACAAGTTTC ACGGGCCTAA AGGTATAGGC GCTTTATATG TAAAACGCAG AACACGCTTT
ATGCCTTTTA TAATAGGCGG ACACCAGGAA AAAGGGCACA GGGCAGGCAC GGAAAATGTG
CCCGCTATAG CGGGTTTCGG CAAAGCGTGT GAAATGGCGT TGGAGAATTT AAAAAACACT
TCTAAAACAG CCGTTTTAAG GGACAGGCTT GAAAAGGGTC TCCTTGCAAA AATTTCTCAT
TCAAAAGTTA ACGGTGATGT TGAAAACAGG CTTCCTAATA CGTCAAATAT AAGTTTTGGC
TATATTGAAG GGGAATCAAT ACTTTTACAT TTAAACGATT ACGGCATTTG CGCTTCTTCA
GGTTCGGCCT GCACGTCCGG AAGTTTGGAG CCGAGCCACG TTTTAAGAGC AATGTGCGTT
GATTTTAATT TTGCGCACGG TTCGGTAAGG TTTTCTTTAA GCGATGAAAA TACAGAACAG
GAAATTGATT TTGTTATAGA AAAACTGCCG CCCATAATCG AGACGCTTCG CCAAATATCA
CCTTTCGGCC GCCGGAGCTA G
 
Protein sequence
MKVIYLDNNA TTRTAPEVVK EMLPYFSEHY GNASSMHTFG GENKKVIEDA RKKMAALIGA 
QYPDEIIITA GGTEADNTAI MSAINSFPDK KHIITSAVEH PAVLEVFKNL QAKGYKVDYI
GVDKNGRFNM DEFKAAVNEN TALVSIMWAN SETGTIFPIE EIAKITKEAG SVFHTDAVQA
VGKIPVNVAD TDINMLSFSA HKFHGPKGIG ALYVKRRTRF MPFIIGGHQE KGHRAGTENV
PAIAGFGKAC EMALENLKNT SKTAVLRDRL EKGLLAKISH SKVNGDVENR LPNTSNISFG
YIEGESILLH LNDYGICASS GSACTSGSLE PSHVLRAMCV DFNFAHGSVR FSLSDENTEQ
EIDFVIEKLP PIIETLRQIS PFGRRS