Gene MCA0212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0212 
SymbolnifS 
ID3103373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp214241 
End bp215389 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content68% 
IMG OID637169435 
Productcysteine desulfurase 
Protein accessionYP_112748 
Protein GI53802612 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.133802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGA CCTCGATCTA CCTGGACAAC AACGCCACGA CACGGCCGGC TCCGGAGTGC 
GTGGCGGCGA TGATGGCCTG CCTCCAGATG CATTATGGCA ACCCCTCCAG CAAGCATCGT
CTGGGCGAGG CCGCCAAGAT GGAGGTCATC GCCGCGCGGG CCAGGCTCGC CGCGCTGCTG
GGCGCCTCTC CGGCGGAAAT CGTTTTCACC AGCGGTGGTA CCGAATCCAT CCAGCAGGCC
ATCCGCGGTG CGCTGGCCTT GGCGGCGGAC AAGCGCCGGG TGGTGACCAG CGCCGTAGAG
CATCCGGCGA CCTTGCTTCT GCTGGAGCAT CTGGAAGCGC AGGGGCTCGA AGTGATCCGT
CTGCCGGTCG ACCGGCAGGG GCGGCTTGAT CTCGCCATGC TGGATGCCGC GCTCACTCCC
GATACCGGCT TGTTGAGCCT GATGTGGGCC AACAACGAGA CCGGCGTGCT GTTTCCCATC
GCGGAAGCGG CGGCGCTGGC CGCGAGCCGG GGGGTACTGT TCCATTGTGA TGCGGTCCAG
GCCGTAGGCA AGCTGCCCAT CGATTTGAGA CTGGTGCCGC TGGATTTCCT GTCCCTGTCT
GGACACAAGC TGCATGGCCC CAAAGGCATC GGCGCGCTGT TCGTGCGCAA GGGCCGCAAA
CTGCCGCCGC TGCTGTTGGG TCACCAAGAG CGCGGGCGCC GCGGCAGCAC CGAGAATGTG
GTGGGCATCG TGGGGCTGGG CGTGGCGGCG GAACTGGCGG CGGAACATTT GGCGAGCGGG
ATCGACGCCG TCGCCCGGCT GCGCGACCGC CTGGAAAGCC GGCTGCTCGC TGCGTTGCCG
GGGGCTTCGG TGAACGGCGC CGGTGCGCCT CGGGTGGCCG GGACGTCCAG CTTCAATCTG
GGGAATGTCG AAGCCGAGCT GGTGCTGGAC AAGCTGGACC GCGCCGGGGT CTGCGCCTCT
GCCGGAGCGG CCTGCAGCGC GGGTGGTACG GAGCCTTCCC ACGTGTTGAC GGCGATGGGG
CTGGGGAAGG AGGGAGCATT GGCCACCCTC CGTTTTTCAT TGAGCCGCTA CACCACCGTG
GCCGAGGTGG ATGCCGTGTG CGGCCTGTTG CCGGGGATCG TGCGCAGCCT GCTGGCCGAG
GCGGCGTGA
 
Protein sequence
MSETSIYLDN NATTRPAPEC VAAMMACLQM HYGNPSSKHR LGEAAKMEVI AARARLAALL 
GASPAEIVFT SGGTESIQQA IRGALALAAD KRRVVTSAVE HPATLLLLEH LEAQGLEVIR
LPVDRQGRLD LAMLDAALTP DTGLLSLMWA NNETGVLFPI AEAAALAASR GVLFHCDAVQ
AVGKLPIDLR LVPLDFLSLS GHKLHGPKGI GALFVRKGRK LPPLLLGHQE RGRRGSTENV
VGIVGLGVAA ELAAEHLASG IDAVARLRDR LESRLLAALP GASVNGAGAP RVAGTSSFNL
GNVEAELVLD KLDRAGVCAS AGAACSAGGT EPSHVLTAMG LGKEGALATL RFSLSRYTTV
AEVDAVCGLL PGIVRSLLAE AA