Gene Mmc1_1905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmc1_1905 
Symbol 
ID4481127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMagnetococcus sp. MC-1 
KingdomBacteria 
Replicon accessionNC_008576 
Strand
Start bp2370123 
End bp2371820 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content52% 
IMG OID639722650 
Productsulfate thiol esterase SoxB 
Protein accessionYP_865819 
Protein GI117925202 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0244969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTGT CACGTCGCGA ATTTATGGAG CTGGGTGCAT TGACCGCTGC GACTGTCGTC 
GCAGGGGCTA GTTCGTTGGG GTATGCCTCG AATCGGGCGG GGTTAGAGAG CCTGTTACGT
TTTAAGCCTG AAGGCAAACT AACCTTGATG CACATTGCAG ACTGTCATGC TCAACTGCTG
CCTATCTATT TTCGTGAGCC CAGTGTCAAT ATTGGGATAG GGGGGGCAAA AGGGCATCCT
CCCCATCTGG TCGGGCATGA TTTTCTGAAT TATTTTGGCT TTGAAAGCGG TGGTATCGAG
GCTTACGCCT ATACCATGCT AGACTATATA AAGCTGGCCC ATGAGCTGGG TCCTGTAGGT
GGTTTTGCTC ACTTGGCCAC CTTGGTGAAG GCCATTCGGC AGGAGCGCGG GGCAGATAAC
TGCATTCTGC TGGATAGCGG TGATACATGG CAGGGCAGCT ATTCCGCCAT GATGACTAAA
GGGGCCGACA TGATCGAGGC CTGTAACCTG TTGGGGGTCG AGGGCATGAC CCCGCACTGG
GAGTTTACCT ATGGTGCTGA GCAGGTAAAA GCCAATATCG CCAAGCTTAA TTTTCCCTTT
TTAGCCCACA ATGTGGTGGA TAGCGAGTGG GAAGAGAAGG TTTTTGAACC CTATCATATC
TATGAAAAGG CTGGAACCAA GGTTGCGGTT ATTGGTCAGG CTTTTCCATA TACCCCCATC
GCCAACCCTC GCCGCATGTT CCCCAAGTGG TCCATGGGCA TTCAGGAGCA GGGTGTGCAA
GAGCAGGTCA ACGCCGCGCG GGAAGAGGGT GCCCAGGTGG TGGTATTGCT CTCGCATAAT
GGTATGGATG TTGATCTGAA ACTAGCCAGC CGCGTGACGG GTATTGATGT GATCTTAGGG
GGGCATACCC ATGATGCCAT TCCCAAACCC TCCCAAGTGA AAAATGCCGA GGGAACCACC
TTGGTGTGTA ACTCGGGTTC CAATGGTAAG TACCTCTCCC GTATGGATCT GGATGTTGCC
GATGGCCGCC TTAAAGGGTG GAACTACCGA TTGATTCCCG TGGTTTCTAA TCTTATTCCT
GCGGATGCTG AAATGGCGGC CCTGATTGAA CGGGTGCGCC AGCCTTATCT ACAGGAGATC
AATACGGTGG TGGGTAAGAC CGATTCGCTG CTCTATCGCC GGGGTAATTT TAATGGTACT
TACGACGATC TGATCTGTCA GGCGCTTAAT AGTCAGTTGG AGAGTGAGGT TTCGCTCTCG
CCTGGTTTCC GCTGGGGGGC ATCGCTGCTA CCTGGGCAGG ATATTACCAT GGATAGCATC
TATACCCAGA CCGCCATTAC CTACCCCAAC ACCTATCGGC GGCAGATGAG TGGTGAACAG
ATCAAGACCA TTCTTGAGGA TGTGGCGGAT AACCTGTTTA ATCCCGACCC TTACCGTCAG
CAGGGTGGAG ATATGGTGCG GGTTGGGGGC TTGCGTTACC GCATTAAACT GGGTGAAACC
ATCGGTAAAC GACTGCATGA TATCGAGATT GGTGGCAAAA AGATGGAGGC AGCTAAAGCG
TACTGGGTGA GTGGCTGGGC CTCCATGGGT GAGGTGGATG GTCCGCCAAT CTGGGATGTG
GTGCGTAAAC ATGTGGAAGA TAAAAAAGTG ATCTCCATTG AACCAGACAC CACCACCATG
GGTGTGACAG AGAGCTAA
 
Protein sequence
MSLSRREFME LGALTAATVV AGASSLGYAS NRAGLESLLR FKPEGKLTLM HIADCHAQLL 
PIYFREPSVN IGIGGAKGHP PHLVGHDFLN YFGFESGGIE AYAYTMLDYI KLAHELGPVG
GFAHLATLVK AIRQERGADN CILLDSGDTW QGSYSAMMTK GADMIEACNL LGVEGMTPHW
EFTYGAEQVK ANIAKLNFPF LAHNVVDSEW EEKVFEPYHI YEKAGTKVAV IGQAFPYTPI
ANPRRMFPKW SMGIQEQGVQ EQVNAAREEG AQVVVLLSHN GMDVDLKLAS RVTGIDVILG
GHTHDAIPKP SQVKNAEGTT LVCNSGSNGK YLSRMDLDVA DGRLKGWNYR LIPVVSNLIP
ADAEMAALIE RVRQPYLQEI NTVVGKTDSL LYRRGNFNGT YDDLICQALN SQLESEVSLS
PGFRWGASLL PGQDITMDSI YTQTAITYPN TYRRQMSGEQ IKTILEDVAD NLFNPDPYRQ
QGGDMVRVGG LRYRIKLGET IGKRLHDIEI GGKKMEAAKA YWVSGWASMG EVDGPPIWDV
VRKHVEDKKV ISIEPDTTTM GVTES