Gene Mmar10_0597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0597 
Symbol 
ID4286885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp690291 
End bp692405 
Gene Length2115 bp 
Protein Length704 aa 
Translation table11 
GC content66% 
IMG OID638140062 
Productcytochrome c biogenesis protein, transmembrane region 
Protein accessionYP_755828 
Protein GI114569148 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4232] Thiol:disulfide interchange protein
[COG4233] Uncharacterized protein predicted to be involved in C-type cytochrome biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.00681901 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTTTTC TGAGTGGGCT TTCGGCCCTG ATCGCTCTGG CCGGATTGGT GGCGACGGCC 
CCGGCTTCCG CGCAATCCTC GTGGTCGAGT GGCGAAGCCA TCATCGAAGC CGATCTGGTC
TCCGATCGTT CGATTGTGGC ACCGGGTGAC AGCTTTCACA TCGGTCTGCA CCAGATCATG
CCGGAGGGGT GGCACACCTA TTGGCGCAAT CCGGGCGATA ACGGCCTGCC GGTCGAGATC
GACTGGGACC TGCCGTCCGG CGTCGAGATC GGCGAGATTG TCTGGCCTGT CCCGATCGAA
CTGCCGCTGA CCGACACGAT CATGGACTAT GGCTATAAGG GCGAGTTGGT CCTGCCGATG
CCGGTAACGG TGGCCAGCGA TTTTGCCGGC GAGGCGATCG AGTTTCGGGC CAATGCGACG
TGGCTGGTCT GTGACACGAT CTGTGTGCCA GAAGACCGCG AGCTTACCCT GACCCTGCCG
GTCGGGCCGG AAGCCGAGCC GGATGAAACC GGGTACTGGT ATATCCGTGG TGCCCTGGAA
AATGAGCCGC GCGCTGATCC GGCGGTCGCA GCCGAATTCG CATTTGAAGG CGGCCGGGTC
ATCCTGGAGC TGTCCGGCGG CGCCTTCGCC AATACGGACG CGATCTCTGA TCTGCGCTTT
TTCCCCTATC AGACCGGCCT GATCCGAAAT GCTGGCGCCC AGTCGGTCGC GACAGGCGAG
GGCAGTACCC TGGTCTTGCT GGAGCCTGGC TATGCCGTAG CCACTGCCGC CAACAGTGCT
CAAGGCGGCG TCATCACCTG GCAGGGCGCG GACGGGCAAA CGCGCCAGAG CGTGGCCATT
GAAGCGCAGC CGGGCGAGGG GGGCTATGAC CTTCCAGCCG TGGCCGGTGC GTCAGTTCCG
CAAGTCATGT CGGGCGGTAT TCTCGGGCTG GTGCTGCTGG CCTTTGGTGG CGGGCTGATC
CTCAACCTGA TGCCTTGCGT CTTTCCGGTC CTGTCGATCA AGGTGCTCAA ATTCGTCCAG
GCCGCCCATG CGGACCCCGG TGCCGTGCGG CGCCAGGGTG CCTTCTTCCT GGCTGGTGTG
CTGATCAGCT TTGTCGGACT GGCCGGCATG CTGGTGATCC TGCGTGAAGT CGGTCTCCCG
GTCGGCTGGG GTTTCCAGCT GCAAATGCCG ATCGTCGTCG CCAGCCTCGC GCTGCTGCTG
TTTGCCATTG GCTTGAATCT GCTCGGCGCC TTTGAAGTCG GGACCCGGTT GATGGGGCTG
GGCGCCGGCC TGGCTGACAA GCCGGGCTGG AAGGGCGCTT TCTTCACCGG TGTCCTCGCC
GTTGTGGTTG CCGCGCCGTG CGTCGGTCCA CTGGCCGCCG GGGCGCTGGG GCTGGCGCTG
ACCCAGCCGG CACCGGTCGT CCTGCTTGTA GCTGCCGCCA TGGGGCTGGG ACTGGCCGCG
CCCTTTGTTG TGCTGTCACT TTCGCCGGGC CTGTTGCGCT TCCTGCCCAA GCCGGGCGCC
TGGATGGTGA CCTTCCGCCA ATTCCTGGCC TTCCCGATGT TCGCATCGGT CGTCTGGCTG
GCCTGGGTGT TGTCGATCCA GTCCGGACCG ACCGGCCTGC TGCTGCTCGG CGCCGCGATG
CTGGCGCTGT CCTTTGCGGT CTGGGCGCAC GGCCAGAATG GGCGTGCCTG GAGTGTGGTT
GCATTGGTCG GACTGGCTCT TGGTGTTGCC AGCGTGGTCA TGATTGCCCG ATTGCCGGCC
ACGACCAGCA CCCAGAGCCT GTCGGCGCGA GAGGAGGCCT GGTCGCGGGC CCGTGTCGCA
GAGCTGCAGG GCATGGGACA GGCCGTGTTC GTGGATGTCA CGGCGGCCTG GTGCGTCACC
TGCCAGATCA ACAAGCTGAC GGTGCTGGGC AGCACCCCGG TCGAGGCGGC GTTCGACCGC
TTCGGTGTTG CCAGCCTGCG CGCCGACTGG ACCAATCGTG ACGAAACCAT CGCGGCCTTG
ATCAGCGAGC ATGATCAGGC CGGTGTGCCG CTCTACCTGC TCTATCCGGC TTCGGGCGGT
GCGCCGCGTG TGCTGCCGAC CGTGCTGACG ACGGGCGGGT TTGTCGATGC GCTGGAATGG
GCGGCCGACA ATTAG
 
Protein sequence
MRFLSGLSAL IALAGLVATA PASAQSSWSS GEAIIEADLV SDRSIVAPGD SFHIGLHQIM 
PEGWHTYWRN PGDNGLPVEI DWDLPSGVEI GEIVWPVPIE LPLTDTIMDY GYKGELVLPM
PVTVASDFAG EAIEFRANAT WLVCDTICVP EDRELTLTLP VGPEAEPDET GYWYIRGALE
NEPRADPAVA AEFAFEGGRV ILELSGGAFA NTDAISDLRF FPYQTGLIRN AGAQSVATGE
GSTLVLLEPG YAVATAANSA QGGVITWQGA DGQTRQSVAI EAQPGEGGYD LPAVAGASVP
QVMSGGILGL VLLAFGGGLI LNLMPCVFPV LSIKVLKFVQ AAHADPGAVR RQGAFFLAGV
LISFVGLAGM LVILREVGLP VGWGFQLQMP IVVASLALLL FAIGLNLLGA FEVGTRLMGL
GAGLADKPGW KGAFFTGVLA VVVAAPCVGP LAAGALGLAL TQPAPVVLLV AAAMGLGLAA
PFVVLSLSPG LLRFLPKPGA WMVTFRQFLA FPMFASVVWL AWVLSIQSGP TGLLLLGAAM
LALSFAVWAH GQNGRAWSVV ALVGLALGVA SVVMIARLPA TTSTQSLSAR EEAWSRARVA
ELQGMGQAVF VDVTAAWCVT CQINKLTVLG STPVEAAFDR FGVASLRADW TNRDETIAAL
ISEHDQAGVP LYLLYPASGG APRVLPTVLT TGGFVDALEW AADN