Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_2022 |
Symbol | |
ID | 4285476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 2204225 |
End bp | 2205604 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638141523 |
Product | protease Do |
Protein accession | YP_757252 |
Protein GI | 114570572 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.303504 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCGTT TCCTAGTTCT CATCCTCTTC CTGGTGGCGA CACCGGTCAG CCAGGCTCAT CCGGATGGTT TCGCCGATCT GGCCGAGCGC CTGTCGCCGG CTGTGGTGAA TATTTCGGCC GCCCAGCGGC TCGACAGCGA GGACGGTCTG CCGGAATTCC CGGAGGGTTC GCCGCTGGAA CGCTTCAACG ATATTTTCGG CTCGGCGCCG CGGATCGCCA ACTCGCTTGG TTCGGGCTTC ATCATCGACG CCTCCGGCCT GGTGGTGACC AATAATCACG TCATCGACGG GGCTGACGAG GTTGAGGTGT CGCTGCCGGA TGGTCGCGTC TTCCGCGCCG AAGTGGTCGG CATCGACAGT GTCACTGACC TCGCCGTCCT GCGCATGGAG GTCAATGAGC CCATGCCCTT CGTCGCCTTT GGCGACAGCG ATGCGGCACG GGTCGGCGAC TGGGTGATTG CGATCGGCAA TCCGTTCGGA CTGGGCGGCA CGCTGACCGC CGGCGTCGTC TCAGCCCGGG GCCGCGAGGC GGGTGGTCGC TATGACGACT ATATCCAGAC CGATGTTGCC ATCAATACCG GCAATTCCGG CGGTCCGTTG TTCAACATGG ATGGCGAAGT GGTCGGCGTG AACACGCTGA TCCTGTCACC GACCGGGGCC AGTGTCGGCA TTTCGCTCTC CATCCCGTCC AACCTGGCCA ATGTGGTCGT CAACCAGCTG GTCGAGTTCG GTGAGACCCG TCGCGGCTGG CTTGGCGTGT CGGTCCAGCG CGTGACGCCG GAACTGGCCG AAAGCTTTGA ACTGGCGACG CCCTATGGCG CCATTGTCTC GCGCATCGAG GAGGATGGTC CGGCGGCGGA TAGTGGTATC CGGACCGGGG ATCTGGTGCT GGCCTTTGAT GGCCGCCGGG TCCGTGACAG CCGTTCCTTC CCGCGCATGG TCGCCGAAAC CGAGATCGGC CGCGAGATCG AGCTCGACAT CATCCGCCGT GACCGGCCGA TGACGATTAA TATCACCGTT GGCAACCTGG CCGAGGACGA TGCCGACGGC GAAGGCGAGA CAGCCAATGC CGCAGTGGTC GCACCGGTGA CGGGCAGCGG CAATACCGTG ATGGGCATGA CATTCGGCAC GCTGGACGCC GCCTCCCGCC GCCGTTTCCG GGTCCATCCG GATGCCGAGG GGGTTCTGGT GACCGAGGTT GACCAGACCA GTGACGCGTC CGGCAAGGTT CGTCCGGGTG ATGTCATCGA GGAAGTCGAG TTCACGCGCG TCGACAGCAT TGCCGCCATC CGGGAGATCG TTGAGGGGCA GGGCAATGGG CCGGTGCGCT TCCAGATCAA TCGCGGCGGC CAGTATGTGC TGCAATCGAT CCGGTCGTGA
|
Protein sequence | MQRFLVLILF LVATPVSQAH PDGFADLAER LSPAVVNISA AQRLDSEDGL PEFPEGSPLE RFNDIFGSAP RIANSLGSGF IIDASGLVVT NNHVIDGADE VEVSLPDGRV FRAEVVGIDS VTDLAVLRME VNEPMPFVAF GDSDAARVGD WVIAIGNPFG LGGTLTAGVV SARGREAGGR YDDYIQTDVA INTGNSGGPL FNMDGEVVGV NTLILSPTGA SVGISLSIPS NLANVVVNQL VEFGETRRGW LGVSVQRVTP ELAESFELAT PYGAIVSRIE EDGPAADSGI RTGDLVLAFD GRRVRDSRSF PRMVAETEIG REIELDIIRR DRPMTINITV GNLAEDDADG EGETANAAVV APVTGSGNTV MGMTFGTLDA ASRRRFRVHP DAEGVLVTEV DQTSDASGKV RPGDVIEEVE FTRVDSIAAI REIVEGQGNG PVRFQINRGG QYVLQSIRS
|
| |