Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_1239 |
Symbol | |
ID | 4286404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 1348917 |
End bp | 1350398 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638140718 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_756469 |
Protein GI | 114569789 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.101838 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.0181716 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGCAT TGATTTCGAT TTCGATCATG GGACTTTCGT TGCTGACCTC GACCGCTGCG GTCGAGGCGC AGGCCAATCA GGAGCGGGTG ATCCCCGAAC ATATTCTTGT CCGGGTCATG CCGCGAGATC TGGTGGGGGT CGATTTGAGC GGCGCCGCGC TGGCAGATGT AAGCTTCGAG CGGGTTACCA TGTCCGGCGC GAACATGTCC GGCGCAAATC TCAGCCGATC CCGGTTTCCG GATGCCAATC TGGACCGCGC GGATCTCTCG AATACCGATC TGACCGAGGC TGACCTCTCC ACGGGAAGAT TTGTCGGCGC AAACTTTCGA GGTGCTTTGC TGCGCAACAC CTCCCTGACC GGTGCCAATC TCACCGGCGC GGATCTGACA GGGGCCCGGG AACTGGGCTA TGAGATCAAT CAGGCCCGCC TTTGCAATAC TCGCCTGAGC GCAACAGTGG TCCTCAATCG CGATTGTGTG GAGCTTGGAG TGCGTGCCGA GACGCGCCAT CTCGTTGATC GCAACGCCCA GCAGCGTCTC CTGCAAAACC GGGCCTGTGT CGGCTGTGAT CTTCGCTTTG CCCAGGCCGC CAACGCGTCG CTGATTGGCG CCGATATCTC CCGCGCCCTG CTGACCGATG CCAATCTGTC GGGCGCGAAC CTGTCTGGGG CCAACCTCGT CGACGCCGAG CTTACCCGCA CCAATTTTGC GCGTGCCAAT CTGTCGAATG TGAGTTTCTC CGGCCGTCCA TTGCCGCGCA ATGTGATATT CACCGGCACC ATTCTTCGTG GCGCCAATCT TTCTGGCCAA GACCTCACGG GGCTCGACTT TCAGGGCGCC GACCTGTCCC GAGCCAATGT CGAGGCGGCC GAAATCGATC GCACGACATT GCTCCGGGAT GCCCGCCTGA CCGGACTGGA CCTGAGCCAT GCCTCCTTGA GCCAGGCGAT TTTTCCGGGA AATGACTTTC GCACCATCAA CCTGACCGGT GTGCAGATTT ACGGGATGGT CCTGACCGGC GCCAATTTCA CCTCTGTCGA GTTGTCGAAT GCCCGGATTG TTGAGAGCAA CATGCAACGG GCCATCCTTG CCGGTGCCAA TCTGTCCTAC GCCGACCTTT CCGGGATTGA CCTTGCTGGC GCCGATCTGA CCGGTGCCGA CTTGAGTGGA GCCAACTTGA TCGGCGCTGA TCTGACAGGC GCCAACCTGA CTCGCGCCAA CCTGACAGGG GCCATCCTGT TCGGGACCGA TCTGACGCGT GCGATCCTGG CCAATGCCCG GTTGAACTCG GCTCAACTGG TCGGCGCCCA GCTAAGCGGC GCCCGGCTCG ACTCGGCGGA CCTGACGGAT GCCAATCTTT TTGGCGCGCA GAACGCGGCC AGCATCCCGG TCTCCGGCAC CATGACCTTC TGCCGCACCA GGATGGCAGA CGGGAGTGAC CGCAGCTCCA GCTGTGGGGC TGCGGTTTCG TCGCCGAAAT AG
|
Protein sequence | MRALISISIM GLSLLTSTAA VEAQANQERV IPEHILVRVM PRDLVGVDLS GAALADVSFE RVTMSGANMS GANLSRSRFP DANLDRADLS NTDLTEADLS TGRFVGANFR GALLRNTSLT GANLTGADLT GARELGYEIN QARLCNTRLS ATVVLNRDCV ELGVRAETRH LVDRNAQQRL LQNRACVGCD LRFAQAANAS LIGADISRAL LTDANLSGAN LSGANLVDAE LTRTNFARAN LSNVSFSGRP LPRNVIFTGT ILRGANLSGQ DLTGLDFQGA DLSRANVEAA EIDRTTLLRD ARLTGLDLSH ASLSQAIFPG NDFRTINLTG VQIYGMVLTG ANFTSVELSN ARIVESNMQR AILAGANLSY ADLSGIDLAG ADLTGADLSG ANLIGADLTG ANLTRANLTG AILFGTDLTR AILANARLNS AQLVGAQLSG ARLDSADLTD ANLFGAQNAA SIPVSGTMTF CRTRMADGSD RSSSCGAAVS SPK
|
| |