Gene MCA2884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2884 
Symbol 
ID3103932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp3076014 
End bp3077135 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content67% 
IMG OID637172012 
Productcysteine desulfurase 
Protein accessionYP_115277 
Protein GI53803004 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTACT TCGATCACAA CGCCACCACC CCGCTGGACG GGCGGGTGCT GGAGGCCATG 
CTGCCTTATC TGAAATCCTG CCACGGCAAT CCCTCCAGCC TGCACCGCCC CGGTCGTATC
GCCCGCGACG CCGTGGAAAC CGCCCGCGCC CAGGTTGCCG CGCTGGTCGG CGCGACAGCC
AGCCAGGTCG TGTTCACCAG CGGCGGCAGC GAAGCCAACA ACCTCGCCCT CAAAGGACTG
GCCTGGAGTC TGAAACCCGG TCACATCCGC ATCGGCGCCA CCGAACACCC CTCTGTCGTC
GAATCCGCGC GGCTCCTCGC CGGGCATGGC TGGGACTGTC GAACCCTGAC GGTGGACGCG
CGGGGGCTGA TCGAAGATGC CGCCCTCGAC GCGATTGCGA AAAACCCGCC CGACATCGTC
TCGGTCATGC TGGCCAACAA CGAAACCGGC GTGATCCAGG ATGTCGCCCG CATCGCCGCT
TTGGCCACGG GCGCCTGGCT CCATTGCGAT GCGGTCCAGG CAGCAGGAAA GATCCCGCTG
AACTTTGCCC GGACCGGCGT CCACCTGATG TCCCTGTCGG GACACAAGAT CGGCGGCCCC
AAAGGAGCCG GCGCCCTGAT TGCCGACGCT TCGGTCCCCT TGACGCCGCT CATCCATGGC
GGCGGACAGG AAAAGGGGCT GCGTGGCGGC ACCGAGAACG TGGCCGCCAT CGTCGGTTTC
GGCAAAGCCG CAGAGCTTGC CGCATCGGAA CTGCAACAGC GCAGCCGGTG GCTTCGCCGG
CTACGGGATC GCCTGGAGCA GGGCATCGAA AAACTCCCCG GCGCAACGGT CTTCGCCCGT
ACCGCGGAAC GGCTGCCCAA TACCCTTCAG TTCGCCGTGG CGGGCTATGA CGGCGAGACC
CTCGTCATGC TGCTGGACCG GCACGGCATC GCGGTTTCCA GCGGTTCAGC CTGCGCCGGC
GGCGCACGCG AACCCAGCCC GGTGCTGCTC GCCATGGGAG TCGATCCGGC TCTGGCGACC
GGCGCGGTAC GGATCAGTCT CGGCAAGGAC AACACCGAAG CGGAGGTGGA ACGACTGTTG
ACCGCGCTCG GTCGAATCCT CGAAACAGGG CAGACCTATT GA
 
Protein sequence
MIYFDHNATT PLDGRVLEAM LPYLKSCHGN PSSLHRPGRI ARDAVETARA QVAALVGATA 
SQVVFTSGGS EANNLALKGL AWSLKPGHIR IGATEHPSVV ESARLLAGHG WDCRTLTVDA
RGLIEDAALD AIAKNPPDIV SVMLANNETG VIQDVARIAA LATGAWLHCD AVQAAGKIPL
NFARTGVHLM SLSGHKIGGP KGAGALIADA SVPLTPLIHG GGQEKGLRGG TENVAAIVGF
GKAAELAASE LQQRSRWLRR LRDRLEQGIE KLPGATVFAR TAERLPNTLQ FAVAGYDGET
LVMLLDRHGI AVSSGSACAG GAREPSPVLL AMGVDPALAT GAVRISLGKD NTEAEVERLL
TALGRILETG QTY