Gene MCA0247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0247 
SymboliscS 
ID3103376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp241910 
End bp243121 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content63% 
IMG OID637169468 
Productcysteine desulfurase 
Protein accessionYP_112781 
Protein GI53802543 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR02006] cysteine desulfurase IscS
[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTAC CGGTCTATCT CGATTACTCG GCTACGACGC CGGTCGACCC ACGTGTTGCC 
GAGAAGATGA TTCCATTCCT GACGGAAAAC TTCGGCAACC CTGCCAGCCG CTCCCATACC
TTCGGCTGGA CCGCCGAGCA GGCGGTGGAG AACGCGCGGG AGGAAGTGGC GAAGCTGGTC
AACGCCGATC CCCGGGAAAT CGTCTGGACC TCCGGCGCGA CCGAGTCGGA CAACCTCGCC
ATCAAGGGTG CGGCCGAGTT CTACCAGACC AAAGGCCGTC ATCTGATCAC CGTGAAGACC
GAACACAAGG CGGTGCTCGA TACCATGCGT GAGCTGGAGT CGTCCGGTTT CGAGGTTACC
TATCTCGAGC CGATGGCCAA CGGTCTCCTG GATCTGGACG CCTTCAGGGC GGCGATCCGG
CCCGACACGG TGCTGGCTTC GGTAATGCAG GTCAACAACG AGATAGGGGT GATTCAGGAC
ATCGCGGCGA TCGGAGGGAT CTGCCGGGAA CACGGCGTGA TCTTCCACGT CGATGCCGCC
CAGGCCACTG GCAAGGTCGA GATCGATCTG GAGCAGCTGC CGGTCGATCT CATGTCTTTC
TCCGCCCACA AGACTTATGG TCCCAAGGGC ATCGGCGCCC TCTACGTGCG CCGCAAGCCG
CGCATCCGGC TCAAGGCACA GATGCACGGC GGTGGCCATG AGCGGGGTTT GCGCTCCGGC
ACCCTCGCCA CGCACCAGAT CGTCGGCATG GGGGAAGCCT TCCGCATCGC CCGCGAGGAG
ATGGCGGCGG AGAACGAACG CATCCGTCGG CTGCGTGACA GGCTGCTGGC CGGTCTCGCG
GACATGGAAG AGGTGTTCAT CAACGGCGAT CTGGAACAGC GGGTGCCGCA CAACCTGAAC
ATCAGCTTCA ACTACGTGGA GGGTGAGTCC TTGATGATGG CCATCAAGGA CCTGGCGGTG
TCCAGTGGTT CGGCCTGTAC CTCGGCCAGC CTCGAGCCTT CCTATGTGCT GCGTGCTTTG
GGCCGCAGTG ATGAGCTGGC CCACAGCTCG ATCCGCTTCA CTCTGGGCCG CTATACCACG
GCGGAGGACG TGGATTTCGC CATCTCCCTG ATCAAGGACA AGGTCGCTCG CCTGCGCGAG
ATCTCGCCCC TGTGGGAGAT GTACAAGGAC GGGATCGACC TCAACACCGT TCAGTGGGCC
GCCGCGCATT AG
 
Protein sequence
MKLPVYLDYS ATTPVDPRVA EKMIPFLTEN FGNPASRSHT FGWTAEQAVE NAREEVAKLV 
NADPREIVWT SGATESDNLA IKGAAEFYQT KGRHLITVKT EHKAVLDTMR ELESSGFEVT
YLEPMANGLL DLDAFRAAIR PDTVLASVMQ VNNEIGVIQD IAAIGGICRE HGVIFHVDAA
QATGKVEIDL EQLPVDLMSF SAHKTYGPKG IGALYVRRKP RIRLKAQMHG GGHERGLRSG
TLATHQIVGM GEAFRIAREE MAAENERIRR LRDRLLAGLA DMEEVFINGD LEQRVPHNLN
ISFNYVEGES LMMAIKDLAV SSGSACTSAS LEPSYVLRAL GRSDELAHSS IRFTLGRYTT
AEDVDFAISL IKDKVARLRE ISPLWEMYKD GIDLNTVQWA AAH