Gene MCA1504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1504 
Symbol 
ID3102610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1602141 
End bp1603400 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content67% 
IMG OID637170679 
Productsulfite oxidase SoxC, putative 
Protein accessionYP_113961 
Protein GI53804403 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.545192 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCTT TCATCGAACC GGAAACCCTT CCTTCCGCAA TCCCCAGTTC GGGGCCGGAC 
CGCCGGCGCT TCCTGAAAGC GGGGCTGGCC GTCACCGGCG CCGCCCTGAC CGGTGGCGTC
CGGGCGGCGC CGCCGCCTTG GATGACGCGG CCCGGCGCCC CGCTTTCCAA TTACGGCCAG
CCCTCGCCGC ACGAACGCGC CGTCATCCGC TGGGTCGCGG CCAATCCCGA TGCGCCGGGG
AACGGCATTT CCTGGACACC GCTGGAACGT CTGGAAGGCA TCATCACCCC CAGCGGCTTG
CACTTCGAGC GCCACCACAA CGGCGTGCCG CAAATCGATC CCGCCGTCCA TCGCCTGGTC
GTGCACGGAC TGGTTGTCAA GTCCTCGAGT TTCGGCATCG ACGACCTCCT GCGCTACCCG
CAGACCTCGC GCCAGTGTTT CGTCGAATGC GGTGGCAACG GCAATGCCGG CTGGCACCTG
GAGCCGATGC AAGCCCCGGC CGGTAACGTC CACGGCCTTG CTTCCTGCAG CGAATGGACT
GGCGTACCGC TGGCTACCGT GTTGGAGGAA TGTGGCCTGC AACCGAACGC CAAATGGCTG
ATCGCGGAAG GCGCCGATGC CGCGGCGATG AACGTCAGCA TTCCCCTGGA AAAGGCGCTG
GACGATGCCC TGCTCGCCCT GTACCAGAAC GGCGAGCGCC TGCGGCCGGA GAACGGTTAT
CCACTGCGGC TCATCCTGCC CGGCTGGGAA GGTGTCACCA ACGTCAAATG GTTGCACCGC
CTGCAGCTTG CGGAGCAGCC CGCGATGGCC CGTAACGAAA CCGCGAAATA CACCGAGCTG
CTGCCCTCCG GCCAGGCCCG GCAGTTCAGT TTCGTCATGG AGGCCAAGTC GCTCATCACT
CGTCCCTCCG CCGGCCAGTC CTTGCCCGGC CCCGGCTTGC ACCCGATCTC CGGGCTGGCC
TGGAGCGGCC GGGGCGCGAT CCGACGGGTG GAAGTTTCGG CCGATGGCGG CAAGACCTGG
CAGGACGCGG CGCTCGACCC GCCCGTGCTG CCCAAGTGCT TCACCCGCTT CCGCCTGCCC
TGGCGCTGGG ACGGCTCGCC TGCCGTACTC AAGAGCCGGG CCACCGACGA AACCGGCTAT
GTCCAGCCCG AACGCCAGAC CCTGATCGCC GAGCGCGGGC GCCACGGCTA CTTCCACTAC
AATGCGATCG TATCCTGGGC CGTCGCCGCC GATGGGAGCG TCAGCCATGT CTATGCGTGA
 
Protein sequence
MKPFIEPETL PSAIPSSGPD RRRFLKAGLA VTGAALTGGV RAAPPPWMTR PGAPLSNYGQ 
PSPHERAVIR WVAANPDAPG NGISWTPLER LEGIITPSGL HFERHHNGVP QIDPAVHRLV
VHGLVVKSSS FGIDDLLRYP QTSRQCFVEC GGNGNAGWHL EPMQAPAGNV HGLASCSEWT
GVPLATVLEE CGLQPNAKWL IAEGADAAAM NVSIPLEKAL DDALLALYQN GERLRPENGY
PLRLILPGWE GVTNVKWLHR LQLAEQPAMA RNETAKYTEL LPSGQARQFS FVMEAKSLIT
RPSAGQSLPG PGLHPISGLA WSGRGAIRRV EVSADGGKTW QDAALDPPVL PKCFTRFRLP
WRWDGSPAVL KSRATDETGY VQPERQTLIA ERGRHGYFHY NAIVSWAVAA DGSVSHVYA