Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_2500 |
Symbol | |
ID | 4286626 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 2721521 |
End bp | 2722822 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638141998 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_757724 |
Protein GI | 114571044 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATC GCAAGATCGA GTTCAAGAAA CTGACCCAGG ATGAAGTCGA CGCGATTTGC GCCCGGCACG AACGCCTGTG GTCGGGGCGC TCCGGTGGCG CACGGGCCAA TTTCTCCTAC ACCATTCTCG AAAACATGGT GCTGGCCAAT CGGGACCTGT CCGATGCCGA CTTCAGCGGT GCCGTGTTGC GCGGTGCCGT CCTGTCCGGA GCCACGCTCA ATAGCGCGGT CTTTTTCGCC GCCGATTTGC GGCGTGCCAA TCTGGAAGGC GCGTCCCTGA GACGGGCGGA CTTGCGCGGT GCCATCATGC GGGGCGCCAA CCTAACCGGG GCTGACCTGA CCGAGGCTGA TATGCGCGAA GGTGCGATCG CCCAAATGGA CAAGGAAAAG GGCCTCGCGG TCCTGACCCA CAAAACCCAG GAAAACGATG CCGGGACCGC CAATTTCACG GGGGCCAATC TCTCGCACTC GAAAATGGCC GGAATCGCCG CCCAAAAAGC CGACTTCACT GACGCCATGC TGCTGGGCAC CCGTTTGATC CGGGCCAATC TGCGCGGATC ATCCTTTCAA GGCGCCAATC TCGAAGGCGC AGACATGTCC GGCGCGGACC TGTCCGATAC TGATTTCACG CATGCCGTTT TGATCGGCAC GAAGACCGTG ATGGCCAAGA CCCAGGGCAT GAACACCGAC GGCGCGTTGA CCAATGCGGC GGTCGGTATT GATCCTGAAG CTCTGGAACA GCCTGTTGCC GACCAAATCG CCGAACATGT CCGCTGGTGT GAAACCAATG GCAAGGAAGG CAAGCCATCC CTGTTCGACG GGACGGATCT CCGTGGCCTG ACATCCCTCG CCGGGAAAAC ACTGACCGCC TTTCATGCGC GTGGCGCGAC ACTTTACGGC CTGGACCTGG AGGGCGCGCA GCTGCAGGGC GCTCGCCTTG AAAAAGCCGA CCTGCGGATG GTCAGCCTGC GTGGCGCGGA CCTGCGTGGA GCCGATCTTA CCGGCGCCAA CCTTTCAAAT GCCGACCTGC GTGACTGCCA ACTTGGGCCG CTTCTGCTCG ATGGCAACCG GGTTCTTCCG GCCTCGCTCG CCGGGGTGCG GGCACGCTAT ACCGATCTGC GGGGCGCCGA CCTGCGTCAG TTGAAGGCGA CCGGCGCCGA TTTTTCCTAC GCCACCCTCA ACGGGACCAA TGTGAAAAGG GCGAGCTTCA ATGGCTCCTG CTTCACGGGT GCCCGTATCG ACGAGGACTT CTCCACCCAG GTTGACTCCC TGGACGGCGC CGTCGATCTC GACGCGGCCT GA
|
Protein sequence | MTDRKIEFKK LTQDEVDAIC ARHERLWSGR SGGARANFSY TILENMVLAN RDLSDADFSG AVLRGAVLSG ATLNSAVFFA ADLRRANLEG ASLRRADLRG AIMRGANLTG ADLTEADMRE GAIAQMDKEK GLAVLTHKTQ ENDAGTANFT GANLSHSKMA GIAAQKADFT DAMLLGTRLI RANLRGSSFQ GANLEGADMS GADLSDTDFT HAVLIGTKTV MAKTQGMNTD GALTNAAVGI DPEALEQPVA DQIAEHVRWC ETNGKEGKPS LFDGTDLRGL TSLAGKTLTA FHARGATLYG LDLEGAQLQG ARLEKADLRM VSLRGADLRG ADLTGANLSN ADLRDCQLGP LLLDGNRVLP ASLAGVRARY TDLRGADLRQ LKATGADFSY ATLNGTNVKR ASFNGSCFTG ARIDEDFSTQ VDSLDGAVDL DAA
|
| |