Gene Msil_3620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3620 
Symbol 
ID7092893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3985495 
End bp3986697 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content64% 
IMG OID643466908 
Productcysteine desulfurase NifS 
Protein accessionYP_002363867 
Protein GI217979720 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.190149 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCGA TCTATCTCGA CAACAACGCC ACGACCCGGG TCGATCCCGC CGTCCTCGAG 
TCGATGCTGC CGTTTTTTAC CGAGCAATTC GGCAACGCCT CGTCGACGCA TTCCTTCGGA
GAGAGCGTCG CCGGCGCGAT CAAGACCGCG CGCAAGCAAT TGCAGGAGCT CCTCGGAGCC
GCCCATGATC ACGAGATCAT CTACACCTCG GGCGGAACCG AAAGCAACAC GACCGCGATC
ATGGCGGCGC TCGAAGCCAT GCCCGGCCGC GATGAAATCA TCACGAGCGC CGTCGAGCAT
CCCTCCGTGC TGCAGCTTTG CGAGCATCTG GAGAAGACCG GCCGCGCGAA AGTTCACGTC
ATCGGCGTCG ATTCTTACGG GCGGCTCGAT CTTGACGGCT ACCGCGCCGC TCTGTCGGAC
AAGGTCGCAC TCGTCTCCAT CATGTGGGCG AACAATGAGA CCGGAACGAT CGTTCCCGTC
GATGCGCTCG CCGATCTCGC CAAGGCGGCC GGCGCCCTGT TTCATACCGA CGCCGTGCAG
GCGATCGGCA AGCTTCCGGT CGCGCTGAAA GATACGGCGA TCGACATGCT GTCCCTCTCC
GGCCACAAGC TGCATGCGCC GAAGGGAATC GGCGCGCTTT ATGTGAAGCG CGGCGTTCGG
TTCAAGCCGC TGATCCGCGG CGGCCATCAG GAGCGCGGAC GGCGCGCCGG AACGGAGAAT
GTTCCCGCAA TCGTCGGCCT CGGCAAGGCG GCTGAGCTCG CCATCCAGAA TTTCGCCGAC
GAGCAGGGGC GCGTCAGGGC TTTGCGCGAT CGGCTCGAGC AGGCGATCTT GGCCAGCGTG
GCGAATTGCG CGGTCAACGG CGATCTGCAT GATCGGCTGC CCAACACGTC CAATATCGCT
TTCGACTACA TCGAGGGCGA GGCCATTTTG TTGCATCTGA CGCGAGCCGG CATTGCCGCC
TCGACCGGAT CGGCCTGCAC CGCGGGATCG ACCGAGCCAA GCCATGTTCT GCGGGCGATG
AATGTCCCGG AAGCGGCGCT CCACGGCGCG ATCCGTTTTT CCCTATCGCG CGACAACAAC
GCGGCGGACG TCGATCGGGT GATCGAAGCG CTGCCGCCGA TCGTCGACAA GCTCCGCGCG
CTGTCGCCGT TCTGGTCCGA TGGAAAGTCG CCCTCCGGCG ACGCCACGCC ACTTTACGCA
TGA
 
Protein sequence
MKPIYLDNNA TTRVDPAVLE SMLPFFTEQF GNASSTHSFG ESVAGAIKTA RKQLQELLGA 
AHDHEIIYTS GGTESNTTAI MAALEAMPGR DEIITSAVEH PSVLQLCEHL EKTGRAKVHV
IGVDSYGRLD LDGYRAALSD KVALVSIMWA NNETGTIVPV DALADLAKAA GALFHTDAVQ
AIGKLPVALK DTAIDMLSLS GHKLHAPKGI GALYVKRGVR FKPLIRGGHQ ERGRRAGTEN
VPAIVGLGKA AELAIQNFAD EQGRVRALRD RLEQAILASV ANCAVNGDLH DRLPNTSNIA
FDYIEGEAIL LHLTRAGIAA STGSACTAGS TEPSHVLRAM NVPEAALHGA IRFSLSRDNN
AADVDRVIEA LPPIVDKLRA LSPFWSDGKS PSGDATPLYA