Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2100 |
Symbol | |
ID | 5713096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 2223872 |
End bp | 2225461 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641268023 |
Product | putative integrase |
Protein accession | YP_001533438 |
Protein GI | 159044644 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTGT CAGCCTATCT AAGCCCTTCC AGACACGGTA TATTCTACTT TCGTTGGCCT CTGCCTCACT CCGGGCGGGA GAAGCGAAGC ACGGTTAAGT TATCGTTAAG AACAAGATGC CCACATCAAG CGGCAGACTT GGCACGATAC TTGGCTGCAC ATGGAAGATT AGTGAGGGAT AGCAACGCGT TGGCGGGACT GAGGCAGAGC GAGATACGCG AGAAGGTCCA GACTTACTTC AAGGCGCAAC TGGACCAGTA CCTTGACTGG CTGGACCGGC GCGGCCTGTC GAAGAACGCC CTTACCGATG CCCGCGAAGA AATGCTGGAC CACGAAAGCC TGCTAGAGTT GGAAACAACC TCCCCGATGT GGTTGCCCAT CTCCCGGTTC AAGCGCGTCA TGGACGTTTC AAACGAACAA TGGGACGCCA GCCAGCCCCG GATCACGTTC CAGCTTCGCC AAGGTCGCCG GGACATGCTC AAGCGCGTTC TGGACGCCGC TGAGGGCCTT GAGCACTATT CCTATGACGA TGCCCCGGCC ATCGCTTCCG CCCCTCTAGA GGCGCTCCCA GCCGCTCCCG GTGTCACACT TGAGAAAGCC ATCGCGGATT ACATGGACGA ACACGCTCAC CGCTGGGACG AAAAGTATAC CGACCAGATA TGGGCGTTCT TGAACATCCT TGTCGAATAT TGCGGGGCAG ATCGCCAGCT TGCGTCGATC ACCAAGCAGG ACGCGCAGGA AATCAAGAAG GTTGTCAGAG CATTGCCGCA GAATCGGCAT GTCAAACCCG AACTTAAAGG CCTCACTTTA CAGGAGGTTG TACAGGTCGA GGGGCACCCC AAGATTTCGA TTACAACCGT GAACAATCAC ATCTCAAACT TCATTCGCTT CTTCAAGTGG GCGAAGAACA ACGACTACAC GCCCCACGCT CTGTTTGAAG GCATGAAGGT GGCTAAAGCG AAAGCGAGCA AAACAGAGCG GAAGCCCTTT AGTCTCGCAC AAACTCAACT CATGTATCAC GAGTTGACTG AAAACACGTC TGGTCTCGTG CGCAAGCAAA GCCACAAATG GGGCACTCTG CTTGGCATGT TCACTGGGGC GCGTCTCAAC GAGATATGCC AGCTATTGAT TTCTGACATT CAGCAGGAAG GCGGTACTTG GTTCTTGAAC ATTGATGACG AAGGTGACGA GCGAAAGCGT GTGAAGTCAA AGGCCAGCAA GCGGAAAGTT CCCATACACT CAGAACTACT TCGCATAGGC TTCTTGGAGT TTGTAGAAAG TCGCTCACGG GATGACCGGT TGTTTCAAGA CTTCGAGTAT CATCGTAACG GCGGTTACGG GCGCAGCCTC AGCCGTTGGT TCAACGAAAA CACTTTTCTC CCCAAGCTGG GGATCAAAAG CCGAGAGTTG GTTTTTCACA GCTTTCGGCA CACGATGGTT ACGCGGCTCA GTCAAGCAAA TGTACCCAAC CCAATCGTGC AGTGCGTCGT TGGCCATGAG CGCGCAGGGG TCACACAGGA CGTTTACTTC GCTGAAGGCT ACACGCTGCC TCAGCTAAAG GATGCAGTTG AAAGGTTCAG TTGGCAGTGA
|
Protein sequence | MKLSAYLSPS RHGIFYFRWP LPHSGREKRS TVKLSLRTRC PHQAADLARY LAAHGRLVRD SNALAGLRQS EIREKVQTYF KAQLDQYLDW LDRRGLSKNA LTDAREEMLD HESLLELETT SPMWLPISRF KRVMDVSNEQ WDASQPRITF QLRQGRRDML KRVLDAAEGL EHYSYDDAPA IASAPLEALP AAPGVTLEKA IADYMDEHAH RWDEKYTDQI WAFLNILVEY CGADRQLASI TKQDAQEIKK VVRALPQNRH VKPELKGLTL QEVVQVEGHP KISITTVNNH ISNFIRFFKW AKNNDYTPHA LFEGMKVAKA KASKTERKPF SLAQTQLMYH ELTENTSGLV RKQSHKWGTL LGMFTGARLN EICQLLISDI QQEGGTWFLN IDDEGDERKR VKSKASKRKV PIHSELLRIG FLEFVESRSR DDRLFQDFEY HRNGGYGRSL SRWFNENTFL PKLGIKSREL VFHSFRHTMV TRLSQANVPN PIVQCVVGHE RAGVTQDVYF AEGYTLPQLK DAVERFSWQ
|
| |