Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0569 |
Symbol | |
ID | 4076134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 605508 |
End bp | 607328 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638005866 |
Product | peptidase M3B, oligoendopeptidase-related clade 3 |
Protein accession | YP_612564 |
Protein GI | 99080410 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR02290] oligoendopeptidase, pepF/M3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0425519 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000722694 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTCCAAC TCCCTTTCCC TGTGCGCGAT GCCAATGCCT CTTCCGGAGC CGGTGATCTT GGCAAATTGC CGGAGTGGGA TCTGAGCGAT CTCTACGCAG GCGAAGACGC TGCGGAACTT TCCCGCGATC TGGACTGGCT GCAGGGTGAA TGCGCTGCCT TTGCCGCCGA CTACGAGGGC AAGCTTGCAG AGCTCGACGC CGAGGGGCTT CTGACCTGCG TCCATCGCAA CGAAAAAATC AACAATATCG CCGGGCGCAT CATGTCCTAT GCGGGCCTGC GCTATTACCA GCTGACAACC GACGCGGACC GCGCCAAATT CCTCTCTGAC GTGCAGGAGA AAGTCACGGT CTTCACCACG CCGCTGGTGT TCTTCACGCT GGAAATCAAC CGCATCGAGG ATGCCAAACT CGACGCGCTC TTTGAGGCAA ACGCCGATCT GGCGCGCTAC AAACCCGTTT TTGACCGGAT CCGCGCGATG AAGCCCTATC AGCTCTCGGA TGAGCTGGAG CGTTTTATGC ACGACCTCGG GATTGTGGGC GATGCCTGGG AAAAGCTCTT TGACGAGACC ATCGCAGGGC TGACCTTTGA GATCGAGGGC GAAGAGCTCG GCATCGAGGC GACGCTCAAC TTCCTGACCG AGCAGGACCG CAGCAAGCGC GAAGCCGCAG CGCGCGAACT TGCCCGCGTC TTTGCCGACA ACATCAAGAT CTTTGCCCGC GTTCACAACA CGCAGGCCAA AGAGAAAGAG ATCATTGACC GCTGGCGCGG CATGCCGAGC CCGCAGATGG GCCGGCACCT CTCGAACGAT GTCGAGCCCG AAGTGGTCGA AGCTCTGCGC GAGGCCGTAG TGGCCGCCTA CCCCAAGCTT TCGCACCGCT ACTATGAGCT CAAACGCAAA TGGCTCGGTC TCGATCGCAT GCAGGTCTGG GATCGCAACG CGCCTCTGCC GATGGAAACC ACCCGCGTGG TGGACTGGGA GGAGGCCCGC GCAACCGTGA TGGAGGCCTA TGAGGCCTTT GATCCACGCA TGGGCGAGCT GGCGCAGCCG TTTTTCGACA AGGGCTGGAT CGACGCAGGC GTCAAACCCG GCAAAGCCCC CGGTGCATTT GCTCATCCAA CAGTGACCAA TGTCCATCCG TATGTGATGC TGAACTACCT GGGCAAACCG CGTGACGTGA TGACGCTGGC GCATGAACTT GGCCACGGTG TTCATCAGGT TCTTGCCGCA GATCAGGGCG AGATGCTCTC TTCGACACCC CTGACCCTTG CGGAAACCGC ATCGGTCTTT GGCGAGATGC TCACTTTCCG CAAGATGCTC GAAAAGGCCC AGACCAAAGA AGAGCGCAAG GTGCTCTTGG CAGGCAAGGT CGAGGACATG ATCAACACGG TCGTGCGCCA GATCGCCTTT TATGACTTTG AGTGCAAACT ACACGCCGCA CGCGCCGAAG GTGAGCTTAC GCCCGAAGAC ATCAACGCCC TCTGGATGAG TGTGCAAGCG GAGTCCCTGG GCGAGTCCTT TGACTTCATG GAAGGGTATG AGACCTTCTG GGCCTATATT CCGCACTTCG TCCACTCGCC CTTCTACGTC TATGCCTATG CATTTGGCGA TGGCCTCGTG AACGCGCTTT ACTCGGTCTA TGCCGAAGGG GCCGAGGGAT TTGAGGACAA GTATTTCGAC ATGCTCAAGG CTGGCGGGTC CAAACATCAC AAAGAGCTTT TGGCACCATT TGGTCTTGAT GCCTCTGACC CCAAGTTCTG GGACAAAGGC CTGTCGATGA TCTCTGGCCT GATTGACGAG CTTGAGGCAA TGGAAGCTTG A
|
Protein sequence | MFQLPFPVRD ANASSGAGDL GKLPEWDLSD LYAGEDAAEL SRDLDWLQGE CAAFAADYEG KLAELDAEGL LTCVHRNEKI NNIAGRIMSY AGLRYYQLTT DADRAKFLSD VQEKVTVFTT PLVFFTLEIN RIEDAKLDAL FEANADLARY KPVFDRIRAM KPYQLSDELE RFMHDLGIVG DAWEKLFDET IAGLTFEIEG EELGIEATLN FLTEQDRSKR EAAARELARV FADNIKIFAR VHNTQAKEKE IIDRWRGMPS PQMGRHLSND VEPEVVEALR EAVVAAYPKL SHRYYELKRK WLGLDRMQVW DRNAPLPMET TRVVDWEEAR ATVMEAYEAF DPRMGELAQP FFDKGWIDAG VKPGKAPGAF AHPTVTNVHP YVMLNYLGKP RDVMTLAHEL GHGVHQVLAA DQGEMLSSTP LTLAETASVF GEMLTFRKML EKAQTKEERK VLLAGKVEDM INTVVRQIAF YDFECKLHAA RAEGELTPED INALWMSVQA ESLGESFDFM EGYETFWAYI PHFVHSPFYV YAYAFGDGLV NALYSVYAEG AEGFEDKYFD MLKAGGSKHH KELLAPFGLD ASDPKFWDKG LSMISGLIDE LEAMEA
|
| |