Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2313 |
Symbol | |
ID | 4078303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2434344 |
End bp | 2436134 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638007635 |
Product | peptidoglycan binding domain-containing protein |
Protein accession | YP_614307 |
Protein GI | 99082153 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAACC TATTTGCCTC GGTGTTGTGG GCAGTGATCC TGTGCTTGAG TGCGCTATCG GGCGCCTCGC AGGCGCAGCA ATCCAGTGCC GGAGGCGTCT GGATTCAGGT GGCCGCGCGC CCGTCGCTGC AACAGGCACA GGAAGAGGCG CGCAATTTCG CGGCCCGCCT GCCCGATGTG TCGGGCTATG CCTTGGGCGG CGGCTGGTAT GGCATCGTGA TTGGCCCCTA TGCTCGCCCC GATGCCGAGC AGGTGTTGCG GGTCTATCGC GCTGAGGGCC AGATCCCGCG CGACAGCTTC ATCACCTTCT CAAGCAATCT GCGCAGCCGG TTTTACCCCG TTGGAGCGGA CGCCGACACA GGCACCACCG CCCCTGCGCC GCTGACCACC TCCGATGCCG AGACCGCGCC GCCAGAGCCA GCGGAGGACG CCACGTCTGA GGTGACGCCC GAGGTGGTCG TGGTGGATGA GACCCCCGCC GAAGCGCGCC AGAGCGAACG CGCCCTCACA CGCGAAGAAC GCATGGATCT GCAAATCGCG CTGAAGGCGG CAGGGTTCTA CACCTCCGGC ATCGACGGTG CCTTTGGCCG CGGCACGCGC GCCTCCATGT CTGACTGGCA GGTGGCACGC GGGTATGAGC CGACCGGCGT TCTGACCACT GCCCAGCGCC AGGCCCTGAT GGACGAGTAT AACGCGCCGC TGATTTCCAC CGGCATGGCG TCTTATACTG ACGAGCGCGC GGGCATCCGC ATGGATCTGC CCTTGGGCGA AGTGGCCTTC AGCCGTTATG AGCCGCCTTT TGCGCATTTT GACACGGCCG GAGATCTCGG CGCTCGGGTT CTGCTGATTT CGCAGCCCGG CGACCGCCGC ACGCTCTATG GTCTCTATGA CATCATGCAG ACGCTGGAGA TTGTGCCGCT CGAGGGACCA CGCGAACGCA GCGGCGACCG CTTCACGCTC GAAGGCCGCA ACAGCGACAT CGTCTCTTAT ACCGAGGCCC GCCTCGACAA TGGCGAGATC AAAGGCTTCA CGCTGGTCTG GCCCGCAGGG GACGAGGCGC GCCGCGCCCG CGTTCTTTCG GCCATGCAAG AGAGCTTTAC CCGTCTGGAG AACGTGCTCG ATCCGGCGGC CGGCATGGAT GATGCGCAGT CCATCGACCT TGTGTCGGGG CTGGAGATCC GCAAACCGCG CCTGTCGCGC TCGGGTTTCT TTGTGTCGGG CGACGGTGTG GTTGTGACCA CCGCCGATGT GGTGAACGGC TGCGGCAGCG TCACAGTAGA CCGCGATGTG AAGGCCGAGG TGCTGTTGCG CGATAGCGAG GCAGGCATTG CAGTGCTGAA ACCTTCCGAG GCACTGGCCC CGATGGCGAC TGCGCCGCTT GCGGCCAACA CGCCGCGCCT GCGCAGCGCG CTTGCCGTGT CGGGCTACTC CTACGGCGGA ATCCTTGGCG CACCAAGCCT CACATGGGGC GCGCTCAGCG ATCTCAAGGG GCTGCAAGGC GAAAGCCATC TGGCGCGTCT GGACCTGACC GCCCAGCCCG GTGATGCGGG CGGGCCGCTT CTTGGCAAGG ATGGCACCGT GCATGGCATG CTGCTGCCGG AGGCCACCAA TGGCCCCACC CTGCCCGAGG GCGTGCGCTT TGCCCTAAAG GGCGAGGTCA TCCGCATGGC GCTCGAACAG GCTGGCGTGG GCTTGCCCGC GACCGAGGCC GCCGCCACCG GCGAGCTGCC CATCTCGATC CTGCAAAAAG AGGCCACTGG CCTCACCACG CTGGTCAGCT GCTGGGAGTA A
|
Protein sequence | MKNLFASVLW AVILCLSALS GASQAQQSSA GGVWIQVAAR PSLQQAQEEA RNFAARLPDV SGYALGGGWY GIVIGPYARP DAEQVLRVYR AEGQIPRDSF ITFSSNLRSR FYPVGADADT GTTAPAPLTT SDAETAPPEP AEDATSEVTP EVVVVDETPA EARQSERALT REERMDLQIA LKAAGFYTSG IDGAFGRGTR ASMSDWQVAR GYEPTGVLTT AQRQALMDEY NAPLISTGMA SYTDERAGIR MDLPLGEVAF SRYEPPFAHF DTAGDLGARV LLISQPGDRR TLYGLYDIMQ TLEIVPLEGP RERSGDRFTL EGRNSDIVSY TEARLDNGEI KGFTLVWPAG DEARRARVLS AMQESFTRLE NVLDPAAGMD DAQSIDLVSG LEIRKPRLSR SGFFVSGDGV VVTTADVVNG CGSVTVDRDV KAEVLLRDSE AGIAVLKPSE ALAPMATAPL AANTPRLRSA LAVSGYSYGG ILGAPSLTWG ALSDLKGLQG ESHLARLDLT AQPGDAGGPL LGKDGTVHGM LLPEATNGPT LPEGVRFALK GEVIRMALEQ AGVGLPATEA AATGELPISI LQKEATGLTT LVSCWE
|
| |