Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3033 |
Symbol | |
ID | 4075738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 855 |
End bp | 1769 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638004534 |
Product | excisionase/Xis, DNA-binding |
Protein accession | YP_611269 |
Protein GI | 99078011 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1910] Periplasmic molybdate-binding protein/domain |
TIGRFAM ID | [TIGR01764] DNA binding domain, excisionase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.627333 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCCGCCC CCCAAGCTGA CTGCGACATG ACTGATCTCG ACCTGCCAGA CCACGAATAT CTCACTGTCC CTGAACTGGC GGAGTTGCTG CGCCTGAAAG AGCGCAAGAT TTATGATCTT GCCGCCTCCG GAGAGGTCCC CTGTTCGCGC GCGACAGGGA AATTGCTGTT TCCGGCCAAT GAGATCCGGG AGTGGATCGC GCGGGCCAAA TCCGGCGGCG AGCCAGCCCC GGTACACCGC CCGCAGATTC TGCTCGGCAG CCATGATCCG CTGCTAGATT GGGCCATCCG GCAGTCGCAA TCTGGATTGG CCAGCTATGT CGACGGATCC ATGGATGGGC TCGAGCGGTT CTTGCAAGGC GAAGGGATAG CAGCCGGCCT GCATTTGCGC GACGAAAAAT CCGGCCAATG GAACGTGCCG ATCGTGGCCC GTATGGCCAC ACGGCAGAAT GCGGTGCTGA TCCATTTTGC CAGCCGCAAA CGTGGTCTTG TCTATCGCGA TCCGGACCTT GACCTTGCCT CTCTGAACGA GATTTCCAAC CTGAGATTTG CACCGCGCCA ACCCGGTTCC GGCACGGATC AGCTGTTTCG AGATCTCGCT GCTGAAGCGC GCCTTGACCT TAAGAAAGTG GATCTTGTCG ATGTGGCGCG CTCCGAAGAT GAGGCCGTGG AGAGCGTGCG CCGTGGTCTT GCGGATGTCA CCTTTGGTCT CGAGGCCGTC GCCAAAAGCT ACGGGCTAAA GTTCACCCCA CTCATCGATG AGGAGTTTGC GCTTCTGGTG GATCGAAAAG CGTGGTTTGA GCCCGCTTTC CAATGTTTTT TGACCTTTTG CCAGACTGAC GCCCTTGCGC AGCGCGCCGC CGGCATGGGT GGCTACAATG TCAGCGCCCT CGGCCGCGTG CGCTGGAACG CTTGA
|
Protein sequence | MAAPQADCDM TDLDLPDHEY LTVPELAELL RLKERKIYDL AASGEVPCSR ATGKLLFPAN EIREWIARAK SGGEPAPVHR PQILLGSHDP LLDWAIRQSQ SGLASYVDGS MDGLERFLQG EGIAAGLHLR DEKSGQWNVP IVARMATRQN AVLIHFASRK RGLVYRDPDL DLASLNEISN LRFAPRQPGS GTDQLFRDLA AEARLDLKKV DLVDVARSED EAVESVRRGL ADVTFGLEAV AKSYGLKFTP LIDEEFALLV DRKAWFEPAF QCFLTFCQTD ALAQRAAGMG GYNVSALGRV RWNA
|
| |