Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1013 |
Symbol | |
ID | 4078233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1083921 |
End bp | 1085192 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638006317 |
Product | OsmC-like protein |
Protein accession | YP_613008 |
Protein GI | 99080854 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [COG1765] Predicted redox protein, regulator of disulfide bond formation |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.325781 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAACCG AACGCATCAG CTTTGCCGGC CATGCCGGAC ATGACCTCGC CGCGCGCCTC GACCTGCCCG AAGGGCCGGT GTTGGCCACC GCCCTTTTTG CCCATTGTTT TACCTGCTCC AAAGACATCC CCGCCGCCCG GCGTATTGCC GCGCGTCTGG CCGCCATGGG GATCGCGGTG CTGCGATTTG ATTTCACCGG GCTGGGTCAT TCGGGCGGCG AGTTTGCAAA CACCAGCTTC ACCTCCAATG TCGCCGATCT GATTGCGGCT GCGCGCTATC TGGCCAGCCG CAACATGGCG CCAGATATGC TGATCGGACA TTCGCTGGGC GGTGCGGCGG TGCTGCGCGC GCGGGCAGGT ATCCCCTCGG TCAAAAGCGT GGTGACGCTG GGTGCGCCCT TTGATCCGGG TCATGTGGCG CATCATTTCG AGGATGCGCT GGAGGAAATC AACCGCACGG GCCGGGCCGA AGTAAACCTT GGCGGGCGGC CCTTTGTGAT CGGCAAGGAA TTTGTCGATG ACATCGGCCA GACGGAACTT GGAGAGGCGA TCTCGGACCT CAGGGCCGCA CTTCTTGTGA TGCATGCGCC GCGGGATGCC ACGGTTTCGA TCGACAACGC GGCCGAGATC TTTGGTGCGG CGCGCCACCC CAAGAGCTTT GTCACCCTTG ATGATGCCGA TCACCTGATC ACCGATCCCT GTGATGCCGA ATATGCAGCA GATATGATTG CCACCTGGGC CACGCGGTAT GTAGACATGA AGCCCCCCGC GCCCCCGCCC GGCGCCCCAG AGGGTGTCAT CCGCGTCACC GAGGCGGACC CGCAGGGGTT CTTGCAGGAT GTGCAGAACG GCCCCTACCA TCATCTGCTG GCTGACGAGC CCGAAGCCTA TGGTGGCACC AACCGGGGCC TGTCACCCTA TGGGTTTGTG GCCGCCGGCC TTGGGGCCTG CACGTCGATG ACCTTGCGGA TGTATGCCCG GCGCAAAGAC TGGCTCCTCG AGGGGATCAG CGTCGAGGTC TGCCACGACA AGGTCCATGC GCAGGATGCC ATCCCCTCTG GCCCTGCCAA GATCGATCGC TTCATGCGGG TGATCCATCT GCAGGGAGAT CTAGATGCGG AGCAGCGCGC CAAGCTTCTG GAGATTGCCG ACAAATGCCC GGTGCATCGC ACCCTTGAAC AGAGCTCACG GGTCGAAACA CGGTTGGAAG ATGCACCAAA GCTGCCTGAT ATGTCCCGCG AAGGGCAGGA TCTCGACACC GCCTCCATTT GA
|
Protein sequence | MPTERISFAG HAGHDLAARL DLPEGPVLAT ALFAHCFTCS KDIPAARRIA ARLAAMGIAV LRFDFTGLGH SGGEFANTSF TSNVADLIAA ARYLASRNMA PDMLIGHSLG GAAVLRARAG IPSVKSVVTL GAPFDPGHVA HHFEDALEEI NRTGRAEVNL GGRPFVIGKE FVDDIGQTEL GEAISDLRAA LLVMHAPRDA TVSIDNAAEI FGAARHPKSF VTLDDADHLI TDPCDAEYAA DMIATWATRY VDMKPPAPPP GAPEGVIRVT EADPQGFLQD VQNGPYHHLL ADEPEAYGGT NRGLSPYGFV AAGLGACTSM TLRMYARRKD WLLEGISVEV CHDKVHAQDA IPSGPAKIDR FMRVIHLQGD LDAEQRAKLL EIADKCPVHR TLEQSSRVET RLEDAPKLPD MSREGQDLDT ASI
|
| |