Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_1665 |
Symbol | |
ID | 4286849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 1826375 |
End bp | 1829647 |
Gene Length | 3273 bp |
Protein Length | 1090 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638141153 |
Product | hypothetical protein |
Protein accession | YP_756895 |
Protein GI | 114570215 |
COG category | [R] General function prediction only |
COG ID | [COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.193209 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAGAA TCATCTGTGC CGTCCTGCTC TTTGCGCTGC CACAAACCGC GTGCGCGCAA TCGGAGAGCG AGACCGAAAC CGGCCCGTTG ACCTCCGAGA ACTTCGCAGG CTTCGAGTTC CGCAGTATCG GGCCCGCTTT CATGTCGGGA CGTATCGCCG ATATCGAAAT CATGCCGGAT GATCCCTCGA CCTGGATCGT CGGCGTCGGT TCGGGCGGGG TCTGGCGCAC GGACAATGCC GGCACGACCT GGTCGTCATT ATTCGATGAT GAAGGCGTCT ATTCCATCGG CACGGTCACG GTCGACCCGT CCAATCCCCA TACGATCTGG GTCGGCACCG GAGAGAATCA CGGCGGCCGG CACCTCGGCT ATGGCAATGG CGTCTATCGC TCCCGCGATG GCGGCGACAG CTGGACCCAT CTGGGGCTTG AATCTTCCGA GCACATTTCG GAGATCCTGA TCCACCCGGA AGACCCCGAT ACGATCTGGG TCGCCTCGCA GGGGCCGCTT TGGTCGCCGG GCGGCGAGCG CGGCCTCTTC AAATCGACCG ATGGCGGCGA GACCTGGACA CTGGTGCTGT CGGCGGGCGA ATGGACCGGC GTGACCGACA TCGTCATGGA CCCCCGCAAT CCCGACCGTC TGGTCGCGGC GACCTGGCAG CGCCACCGGA CGGTGGCCGC CTATATGGGC GCCGGTCCGG AGAGCGGGCT GCACGTGTCT GACGATGGTG GCGATACCTG GCGCGAAGTC ACCGCCGGAT TGCCGAGCGG CAATATGGGC AAGATCGGCC TGGCGATCTC GCCGCAAAAC CCTGACGTGC TCTATGCGGC GATCGAACAG GACCGCCGCA CGGGCGGGGT CTACCGGTCC AGCAATCGCG GCGAGAGCTG GACCCGCATG TCCGACACGG TCTCGGGCGG AACGGGCCCG CACTACTACC AGGAGCTCTA CGCCCACCCG CATCATTTCG ATCGGATCAT CCTGGTCTCC AACACCACCC AGATCTCCGA GGATGGCGGC GCAACATTCC GCGGCCTGAA CAATGAGACC AAGCATGTCG ATGACCATGC CATCGGCTTC CACCCGACCG ATCCGGACTA TCTGCTGGTT GGCTCGGACG GCGGTCTGTA CGAGAGCTGG GACGGCGATG CGACCTGGCG CTTCATCTCC AACCTGCCGA TCACGCAATT CTACGACATC GCCCTGGACG ACGCCGAGCC CTTCTACAAT GTCTATGGCG GCACCCAGGA CAATAATACC CAGATGGGCC CCTCGCGAAC GGACAGCCGG CACGGCATCC GCAATTCGGA CTGGGTCGTG ACCCTGTTCG GCGATGGTCA TGAACCCGAT GTCGAGCCCG GCAATCCGGA CATTGCCTAT TCAAGCTGGC AGCAGGGAAA TCTGGTCCGC TTCGATCGCA CGACAGGCGA GCTGGTCTAT GTTCGTGCCC AGCCCCAGCC CGGCGAACCG GCCGAGCGCT TCAACTGGGA TGCGCCGCTG GTGGTGTCCT CGCATCAGCC GACCCGGCTC TATCACGCCT CGCAACGCGT CTGGCGCTCG AATGATCGCG GCGACAGCTG GACACCCCTC TCCGGTGACC TGACCCGTGA CCAGGACCGC ATGTTGCTGC CGATCATGGG GCGGCAATGG TCCTGGGATG CCGGCTGGGA CATCTATGCG ATGTCGGTCT ACAACACGAT CAGCGCGCTG GCGGAGTCGC CTGTCGACGA GAATGTGATC TATGCCGGTA CCGATGACGG TCTGGTCCAG ATCACGGTCG ACGGCGGCGA CAACTGGCGG CGGACCGAAG CCGGCGACCT GCCGGGCGTG CCCGACCTCG CCTATATCAA TGATTTCGAG CCATCCCGTT TTGACGCCAA CACGGTCTAT ATGGCGCTCG ACAACCACAA ATACGGTGAT TTCACCCCCT ATCTCCTGCG CTCGGACAAT CAGGGCCGCA GCTGGCGCAT GATCACCGAC GGATTGCCAG AGAATGGCCC TGTCTGGCGC ATCGTCCAGG ACCACGAGAA TCCGGACCTG CTCTTCATCG GTACGGAATT CGGTGTCTAT TTCACCCTCG ATGGCGGCGA CAACTGGACC CAGCTGACCG CCGGCATGCC GCCCATTCCC GCCCGCGATC TTTTGATCCA TGAGCGCGAG GACGATCTTG TAGTCGGCAC GTTCGGCCGG TCCATCTATG TCCTCGACGA TATTGCCCCC CTGCGCGACC TCTCGGTCGA CTCGCTCGAG ACCGACACGC GACTTTTCCC GGTCCGCCGG GCCTGGTGGT ATCAGGAACA ACACGAGCTT GGATTCGGCT TCAGGGCGTC TCAGGGCGAC GGCTATTTCC AGGCCGAGAA CCCGGCCTTC GGGGCCCTCA TCACCTATTA TGTCAGCGAG GGCCTGCAGA CGTCCGAAGA GGCGCGCCAG GAGATGGAAA GCCCGTTGGT CGAAGCGGGC GAGGATACAC CCTTCCCCGG CTTCGACGTG ATCGAATCGG AACGACGTGA AGCGCCGCCG CGTTTCTGGC TGGTCATTCG CGACAGCGCC GGTGATGTGG TGCGGCGTCT GCCCGGGCCT GTTTCGTCCG GCTTCCACCG TGTCACCTGG GATCTCACCT ATCCGAGCTC CGACGCGGTG ACCACGCCCG ATACGGCAGA TTCCGAGACA TCGGGCTATC TCGTCGCTCC GGGCCGTTAC TCGGTCGAGC TGGTGCGACA GAGTAATGGC GAAACCGTCG TAATGGGTGA GGCCCAGACG ATCGAGGTTG AACGCCTCAC CGATGGCGCC CTGCCCGGCT CGCCGGATGA CGAGGTGGTT GCCTTCTGGG AGCGGCTTGC CTCGATGCAG CGTCAGGTCT CTGCGGCCTC GGCGTCCATC GAGTTGACCC ATGCGCGCAT CGCACGCCTG CAATCAGCGC TTTATCGAAC GCGTACGGCG CCGGGTACGC TTGATGACCG CTACGAAGCC CTGAGACAGG AATTGTTCGC CATTGAGGAA GCTCTGGGCG GCAATCAGAC CATGGCGGGC CGCCATGGCG CGCAAGCGTC CACGGTCGGC GCACGGCTAT CCTTCGCGCA ATTGGGAACC GGCAATTCGA GCTACGGCCC TTCGCCGTCC CATGAAGCAC AATTGCAGAT GGCGGAAGAA GAAATGGCCG GCATTCGTGA ACGGCTGGCC AATCTGACCG AGGCGGCCAT TCCGGCGTTC GAGGCTGACC TGGCCGCGGT CGATGCGCCA TGGACGCCGG GCACGCCAAT GCCGCCCTGG TGA
|
Protein sequence | MRRIICAVLL FALPQTACAQ SESETETGPL TSENFAGFEF RSIGPAFMSG RIADIEIMPD DPSTWIVGVG SGGVWRTDNA GTTWSSLFDD EGVYSIGTVT VDPSNPHTIW VGTGENHGGR HLGYGNGVYR SRDGGDSWTH LGLESSEHIS EILIHPEDPD TIWVASQGPL WSPGGERGLF KSTDGGETWT LVLSAGEWTG VTDIVMDPRN PDRLVAATWQ RHRTVAAYMG AGPESGLHVS DDGGDTWREV TAGLPSGNMG KIGLAISPQN PDVLYAAIEQ DRRTGGVYRS SNRGESWTRM SDTVSGGTGP HYYQELYAHP HHFDRIILVS NTTQISEDGG ATFRGLNNET KHVDDHAIGF HPTDPDYLLV GSDGGLYESW DGDATWRFIS NLPITQFYDI ALDDAEPFYN VYGGTQDNNT QMGPSRTDSR HGIRNSDWVV TLFGDGHEPD VEPGNPDIAY SSWQQGNLVR FDRTTGELVY VRAQPQPGEP AERFNWDAPL VVSSHQPTRL YHASQRVWRS NDRGDSWTPL SGDLTRDQDR MLLPIMGRQW SWDAGWDIYA MSVYNTISAL AESPVDENVI YAGTDDGLVQ ITVDGGDNWR RTEAGDLPGV PDLAYINDFE PSRFDANTVY MALDNHKYGD FTPYLLRSDN QGRSWRMITD GLPENGPVWR IVQDHENPDL LFIGTEFGVY FTLDGGDNWT QLTAGMPPIP ARDLLIHERE DDLVVGTFGR SIYVLDDIAP LRDLSVDSLE TDTRLFPVRR AWWYQEQHEL GFGFRASQGD GYFQAENPAF GALITYYVSE GLQTSEEARQ EMESPLVEAG EDTPFPGFDV IESERREAPP RFWLVIRDSA GDVVRRLPGP VSSGFHRVTW DLTYPSSDAV TTPDTADSET SGYLVAPGRY SVELVRQSNG ETVVMGEAQT IEVERLTDGA LPGSPDDEVV AFWERLASMQ RQVSAASASI ELTHARIARL QSALYRTRTA PGTLDDRYEA LRQELFAIEE ALGGNQTMAG RHGAQASTVG ARLSFAQLGT GNSSYGPSPS HEAQLQMAEE EMAGIRERLA NLTEAAIPAF EADLAAVDAP WTPGTPMPPW
|
| |