Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2202 |
Symbol | |
ID | 4078193 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2311933 |
End bp | 2313438 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638007524 |
Product | hypothetical protein |
Protein accession | YP_614196 |
Protein GI | 99082042 |
COG category | [C] Energy production and conversion |
COG ID | [COG3488] Predicted thiol oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.719415 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGGCCG CACCGCTTCA TGCGACGGAC CGCCCTGCCC CCGCGGCGCC TGGTTTGGCA GAGCCGCATC TGTCGATCTT GCCCCGAACC ACACATGAAA CCCGTCGGGT GGAACGCGTC ACCGCGCCCA CTGAGGACTT TGCTGCACCC GAAGCGTTCG AGGAAAACTC TGCCGGAACT GCGACCCTGC CGCACACAGA TGATCGCGAT GCCTTCAGCC AGCCGCCCGC AAACCTCACC TTTGAGCAAG GGTTGGAGTT TCATCTCGGG GAAGCGCTGT TTGACAAACT ATGGGTGTTT TCTCCGGCCT CGACGCTGGC GTCAGATGGG CTCGGCCCCG TCTACAATGC CCGCGCCTGC CAGCGATGTC ATATCCGCGA CGGTCGTGGT CATTTGCCAG AGGGGCCGGA CTATCGCTCT GCCTCGACGT TTTTGCGCAT ATCGATCCCG GGCCCGGTGC CACCGCACCT GCGCGCCATT CCAGACTATA TCGGCATGGT GCCAGAGCCG ACCTATGGCT GGCAGTTGCA GGATTTTTCT GCCCCCGGCG TAGATCCCGA ATACCGCCTG TCCGTCACCT TTACCGAGGA AGAAATTGCC CTGAATGGCG GCGAGGTCGC GCATCTGCGC AAACCCCAAT TCGAGGCGTC TGCGCTCAAA TACGGCCCGC TTGATTCAGA GGCTATGCTG TCACCGCGCG TCGCTCCACC GATGATCGGG CTCGGCCTTC TGGAGGCCAT CCCGGCCGCC GACATCCTCG CGCTGGCCGA CCCGGACGAC CTGGACGGCG ATGGCATCTC GGGGCGCGCC AACCTCGTAT GGTCGAAGGA ATTCGACCAG GTCATGCTTG GTCGGTTTGG CCTCAAAGCC GGGACCGCCA CCATTCACGA ACAATCCGCG GCGGCCTTCT CTGGCGATAT CGGCATCTCG ACCCCTTTGT TTCCCCAGCC CTACGGGGAC TGCACCGCTC AGCAGGTTGA TTGCCGCGCT GCCCCTCATG GCGATGAAGA CTTGCGGGGC ACTGAAATCG ACCAGCCCAA TATGGATCTC GTGACCTTTT ACAGCCGCAA TATCGCAGTA CCGGCCCGCC GCAACTTGGA TGCGCCAGAG GTGTTGCGCG GTAAATCCGT ATTCTACGAG ATCGGCTGCA CCAGCTGCCA CACCCCGAAG TTTGTCACCC ACCGCCTGAT AGACCGCCCC GAGCAGAGCT TTCAGCTCAT CTGGCCCTAC TCTGACCTGC TGCTTCACGA TATGGGTGAG GGGCTGGCCG ACAACCGCCC CGAAGCGCGC GCCACGGGGC GCGAGTGGCG CACCGCCCCG CTCTGGGGGA TCGGGCTGAC GCAACAGGTC TCTGCTCGCG CCACGTTCCT GCATGATGGC CGCGCCCGCA CCCTTCTTGA GGCGATCCTC TGGCACGGCG GCGAAGCACA AGCCGCCCGC GACCGGGTGG TGGCCCTGCC CCCTGCCGAC CGCGCGGCGC TTATTTCATT CCTGGAGTCC CTTTGA
|
Protein sequence | MLAAPLHATD RPAPAAPGLA EPHLSILPRT THETRRVERV TAPTEDFAAP EAFEENSAGT ATLPHTDDRD AFSQPPANLT FEQGLEFHLG EALFDKLWVF SPASTLASDG LGPVYNARAC QRCHIRDGRG HLPEGPDYRS ASTFLRISIP GPVPPHLRAI PDYIGMVPEP TYGWQLQDFS APGVDPEYRL SVTFTEEEIA LNGGEVAHLR KPQFEASALK YGPLDSEAML SPRVAPPMIG LGLLEAIPAA DILALADPDD LDGDGISGRA NLVWSKEFDQ VMLGRFGLKA GTATIHEQSA AAFSGDIGIS TPLFPQPYGD CTAQQVDCRA APHGDEDLRG TEIDQPNMDL VTFYSRNIAV PARRNLDAPE VLRGKSVFYE IGCTSCHTPK FVTHRLIDRP EQSFQLIWPY SDLLLHDMGE GLADNRPEAR ATGREWRTAP LWGIGLTQQV SARATFLHDG RARTLLEAIL WHGGEAQAAR DRVVALPPAD RAALISFLES L
|
| |