Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_4035 |
Symbol | |
ID | 5714564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009957 |
Strand | + |
Start bp | 99093 |
End bp | 100121 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641276947 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_001542243 |
Protein GI | 159046573 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.327975 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTGT TTGTCGGATT GGATGTGTCG CTTGCGAAGA CTTCGGTCTG CGTGATCAGC GAGTACGGCA AGATTATCAA AGAGGCAGAG ACTGAAAGCG AACCCGAGGT TCTGGCGCGC TGGCTGCATG ATCTGGACGG CAGCATCGCG GCGATTGGCC TGGAGGCTGG GCCTCTGTCG CAATGGCTGC ACCGAGGGCT GACCGAAGCT GGCCTTGATA CGGTGCTCAT GGAAACGCGC CAAGTGAAAG GAGCGCTGAA GGCGATGCCG ATCAAGACGG ATCGGCGCGA TGCAGAAGGG ATTGCACGCC TTCTTCATCT CGGCTGGTTC CGCCCGGTCC ACTGTAAATC CGTGTCTGCT CAGGAAACCC GGGCGGTTCT TGGCGCTCGA AAGGCTATCC AGCAGAACAT GATCGCTCTG GAAATGTCGT TGCGCGGACT CCTGCGGAAC TTTGGCCTCA AGGTCGGCGC GATCTCCCGT GGCAGGTTTG AGACACGCAT TCGGGAGTTG GCAGATGGCA ACCCGATGCT GGAAACCGCG ACAGACCCGA TGCTGCGGGC CCGGGCGACC CTACGGCAGG AACTGGCCGG GCTCGAAGAA CGCGTGCGCC AGTTGGCCTG GGATGATCAG GTTTGCCAAC GGCTTATGTC GATGCCTGGA ATCGGTGCGG TCGTAGCACT TACATTCCGT GCTGCGGTCG ATGATCCTGC CCGCTTTCGG TCTTCAAAGA GAATTGGCCC CTGGGTTGGC CTGACGCCCT CACGCAACCA GTCCGGTGAA CGAGACGTGT CAGGCGGCAT CACCAAGGCT GGTGACGTCA ATCTGAGGCG AACATTGTGC CAGGCAGCAA CCGTCATGAT GAATCGCGGC CGATCGACAT GGCTGAGAAC ATGGGGAGCC CAGCTCGCGC AGCGGCGTGG TCGCAAAATC GCGATGGTCG CCCTCGCACG CCGCATCGCT GTCATCCTCC ATCGGATTTG GGTCGATGGC ACAACCTTCC AGCCAGATGC CGCGCCGAAC CTTGCCTGA
|
Protein sequence | MKLFVGLDVS LAKTSVCVIS EYGKIIKEAE TESEPEVLAR WLHDLDGSIA AIGLEAGPLS QWLHRGLTEA GLDTVLMETR QVKGALKAMP IKTDRRDAEG IARLLHLGWF RPVHCKSVSA QETRAVLGAR KAIQQNMIAL EMSLRGLLRN FGLKVGAISR GRFETRIREL ADGNPMLETA TDPMLRARAT LRQELAGLEE RVRQLAWDDQ VCQRLMSMPG IGAVVALTFR AAVDDPARFR SSKRIGPWVG LTPSRNQSGE RDVSGGITKA GDVNLRRTLC QAATVMMNRG RSTWLRTWGA QLAQRRGRKI AMVALARRIA VILHRIWVDG TTFQPDAAPN LA
|
| |