Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_0597 |
Symbol | |
ID | 4286885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 690291 |
End bp | 692405 |
Gene Length | 2115 bp |
Protein Length | 704 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638140062 |
Product | cytochrome c biogenesis protein, transmembrane region |
Protein accession | YP_755828 |
Protein GI | 114569148 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein [COG4233] Uncharacterized protein predicted to be involved in C-type cytochrome biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.00681901 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGTTTTC TGAGTGGGCT TTCGGCCCTG ATCGCTCTGG CCGGATTGGT GGCGACGGCC CCGGCTTCCG CGCAATCCTC GTGGTCGAGT GGCGAAGCCA TCATCGAAGC CGATCTGGTC TCCGATCGTT CGATTGTGGC ACCGGGTGAC AGCTTTCACA TCGGTCTGCA CCAGATCATG CCGGAGGGGT GGCACACCTA TTGGCGCAAT CCGGGCGATA ACGGCCTGCC GGTCGAGATC GACTGGGACC TGCCGTCCGG CGTCGAGATC GGCGAGATTG TCTGGCCTGT CCCGATCGAA CTGCCGCTGA CCGACACGAT CATGGACTAT GGCTATAAGG GCGAGTTGGT CCTGCCGATG CCGGTAACGG TGGCCAGCGA TTTTGCCGGC GAGGCGATCG AGTTTCGGGC CAATGCGACG TGGCTGGTCT GTGACACGAT CTGTGTGCCA GAAGACCGCG AGCTTACCCT GACCCTGCCG GTCGGGCCGG AAGCCGAGCC GGATGAAACC GGGTACTGGT ATATCCGTGG TGCCCTGGAA AATGAGCCGC GCGCTGATCC GGCGGTCGCA GCCGAATTCG CATTTGAAGG CGGCCGGGTC ATCCTGGAGC TGTCCGGCGG CGCCTTCGCC AATACGGACG CGATCTCTGA TCTGCGCTTT TTCCCCTATC AGACCGGCCT GATCCGAAAT GCTGGCGCCC AGTCGGTCGC GACAGGCGAG GGCAGTACCC TGGTCTTGCT GGAGCCTGGC TATGCCGTAG CCACTGCCGC CAACAGTGCT CAAGGCGGCG TCATCACCTG GCAGGGCGCG GACGGGCAAA CGCGCCAGAG CGTGGCCATT GAAGCGCAGC CGGGCGAGGG GGGCTATGAC CTTCCAGCCG TGGCCGGTGC GTCAGTTCCG CAAGTCATGT CGGGCGGTAT TCTCGGGCTG GTGCTGCTGG CCTTTGGTGG CGGGCTGATC CTCAACCTGA TGCCTTGCGT CTTTCCGGTC CTGTCGATCA AGGTGCTCAA ATTCGTCCAG GCCGCCCATG CGGACCCCGG TGCCGTGCGG CGCCAGGGTG CCTTCTTCCT GGCTGGTGTG CTGATCAGCT TTGTCGGACT GGCCGGCATG CTGGTGATCC TGCGTGAAGT CGGTCTCCCG GTCGGCTGGG GTTTCCAGCT GCAAATGCCG ATCGTCGTCG CCAGCCTCGC GCTGCTGCTG TTTGCCATTG GCTTGAATCT GCTCGGCGCC TTTGAAGTCG GGACCCGGTT GATGGGGCTG GGCGCCGGCC TGGCTGACAA GCCGGGCTGG AAGGGCGCTT TCTTCACCGG TGTCCTCGCC GTTGTGGTTG CCGCGCCGTG CGTCGGTCCA CTGGCCGCCG GGGCGCTGGG GCTGGCGCTG ACCCAGCCGG CACCGGTCGT CCTGCTTGTA GCTGCCGCCA TGGGGCTGGG ACTGGCCGCG CCCTTTGTTG TGCTGTCACT TTCGCCGGGC CTGTTGCGCT TCCTGCCCAA GCCGGGCGCC TGGATGGTGA CCTTCCGCCA ATTCCTGGCC TTCCCGATGT TCGCATCGGT CGTCTGGCTG GCCTGGGTGT TGTCGATCCA GTCCGGACCG ACCGGCCTGC TGCTGCTCGG CGCCGCGATG CTGGCGCTGT CCTTTGCGGT CTGGGCGCAC GGCCAGAATG GGCGTGCCTG GAGTGTGGTT GCATTGGTCG GACTGGCTCT TGGTGTTGCC AGCGTGGTCA TGATTGCCCG ATTGCCGGCC ACGACCAGCA CCCAGAGCCT GTCGGCGCGA GAGGAGGCCT GGTCGCGGGC CCGTGTCGCA GAGCTGCAGG GCATGGGACA GGCCGTGTTC GTGGATGTCA CGGCGGCCTG GTGCGTCACC TGCCAGATCA ACAAGCTGAC GGTGCTGGGC AGCACCCCGG TCGAGGCGGC GTTCGACCGC TTCGGTGTTG CCAGCCTGCG CGCCGACTGG ACCAATCGTG ACGAAACCAT CGCGGCCTTG ATCAGCGAGC ATGATCAGGC CGGTGTGCCG CTCTACCTGC TCTATCCGGC TTCGGGCGGT GCGCCGCGTG TGCTGCCGAC CGTGCTGACG ACGGGCGGGT TTGTCGATGC GCTGGAATGG GCGGCCGACA ATTAG
|
Protein sequence | MRFLSGLSAL IALAGLVATA PASAQSSWSS GEAIIEADLV SDRSIVAPGD SFHIGLHQIM PEGWHTYWRN PGDNGLPVEI DWDLPSGVEI GEIVWPVPIE LPLTDTIMDY GYKGELVLPM PVTVASDFAG EAIEFRANAT WLVCDTICVP EDRELTLTLP VGPEAEPDET GYWYIRGALE NEPRADPAVA AEFAFEGGRV ILELSGGAFA NTDAISDLRF FPYQTGLIRN AGAQSVATGE GSTLVLLEPG YAVATAANSA QGGVITWQGA DGQTRQSVAI EAQPGEGGYD LPAVAGASVP QVMSGGILGL VLLAFGGGLI LNLMPCVFPV LSIKVLKFVQ AAHADPGAVR RQGAFFLAGV LISFVGLAGM LVILREVGLP VGWGFQLQMP IVVASLALLL FAIGLNLLGA FEVGTRLMGL GAGLADKPGW KGAFFTGVLA VVVAAPCVGP LAAGALGLAL TQPAPVVLLV AAAMGLGLAA PFVVLSLSPG LLRFLPKPGA WMVTFRQFLA FPMFASVVWL AWVLSIQSGP TGLLLLGAAM LALSFAVWAH GQNGRAWSVV ALVGLALGVA SVVMIARLPA TTSTQSLSAR EEAWSRARVA ELQGMGQAVF VDVTAAWCVT CQINKLTVLG STPVEAAFDR FGVASLRADW TNRDETIAAL ISEHDQAGVP LYLLYPASGG APRVLPTVLT TGGFVDALEW AADN
|
| |