Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3878 |
Symbol | |
ID | 5714407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009956 |
Strand | - |
Start bp | 100522 |
End bp | 101904 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641276791 |
Product | transposase IS4 family protein |
Protein accession | YP_001542087 |
Protein GI | 159046416 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3666] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.726843 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGGGC CAAGGCAGGA AGCACAGGCG GCACTGTTTT ACGAGTTTTC GCTGGAGGAG CATGTCCCGC AGGACCACCT TTTGAGATCG ATTGATCGGC ATCTCGATCT GAGCAGCATC CGGGGGCATT TGGCAGATTT CTATAGCCAC ACGGGGCGTC CATCTGTCGA TCCTGAGCTG ATGATCCGGA TGCTGTTGGT CGGATACTGT TTTGGCATCC GGTCAGAGCG GCGGCTCTGC GAAGAGGTGC ATCTGAACCT GGCATACAGA TGGTTCTGCC GCCTTGAACT GACAGACCGC ATCCCGGACC ATTCGACATT TTCCAAGAAC CGGCACGGCC GCTTCCGTGA CAGTGACCTC TTGCGTCATG TGTTCGAGGC GACTGTTGCG CGCTGCATTG AAGAGGGTTT GGTCGGCGGC CAGGGCTTTG CGGTCGATGC CAGCCTGATC AGCGCGGATG TCCAGAAGCA GAACTCGAGC AATCCCGAAG GCTGGGCGGC CCGCGAGATT GATCCCACGG ATGCGCCCCG CGCGGTGCGG GAGTATCTCG ACACTTTGGA CGATGAAGCC TTCGGTGCAG CGACAACAGC AAAACCCAAG TTCACCGCCC ATGCCGATCC GGCCAGTCAA TGGACGGCTG CGCGCAAAGG GCCTGCATTC TTTGCCTATT CTGACAACTA CCTGATCGAC ACCGATCACG GGATTATCGT TGACGTGGAC GCCAGCCGGT CGAACAAGAC CGCCGAGGTC GGTGCCATGC GGAAGATGCT CGACCGGACC GAAGACCGGT TTGGCGTGAA GCCCGATTGG ATCGCTGCTG ACACCGCCTA CGGATCGTCA GACAACCTGG TCTGGCTGGC ACTCAAGCGC CAGATCCTCC CCTTCATCCC TGTCTTTGAT AAAGGTGAAC GGACCGACGG AACCTTCTCG CGGTCCGACT TCACGTGGGA TGACGAGAAC GATCGCTACA TCTGCCCGAG TGGAAAGGAG ATGCGCCACA CATGGCGGAC CTATTCCGAT CCCGCGCGAA ATGCACCAGC TTGGAAAGCC CGCAGATATC GGACGCGGAA GTCTGATTGC ACGGGATGTG CGCTGAAGGC CAAATGCTGC CCCAACTCGG AGGTCCGTGC GATCCATCGC GAGAAATATG AGATCGTCCG AGACTTCGCC CGCCAATGCA CCGCCTCAGA GTACAATCCA ACTGCCCAGA GGCGGCGAAA GAAAGTAGAG ATGCTCTTTG CCCACCTTAA ACGCATCCTC GGCCTGGGCC GGCTCCGATT ACGTGGCCCA TGCGGCGTCC AAGACGAGTT TACCCTCGCA GCCACCGCCC AAAACCTTCG GAAACTAGCA AAACTCAAAC CCATGGTGCC GGCCACAGAA TGA
|
Protein sequence | MMGPRQEAQA ALFYEFSLEE HVPQDHLLRS IDRHLDLSSI RGHLADFYSH TGRPSVDPEL MIRMLLVGYC FGIRSERRLC EEVHLNLAYR WFCRLELTDR IPDHSTFSKN RHGRFRDSDL LRHVFEATVA RCIEEGLVGG QGFAVDASLI SADVQKQNSS NPEGWAAREI DPTDAPRAVR EYLDTLDDEA FGAATTAKPK FTAHADPASQ WTAARKGPAF FAYSDNYLID TDHGIIVDVD ASRSNKTAEV GAMRKMLDRT EDRFGVKPDW IAADTAYGSS DNLVWLALKR QILPFIPVFD KGERTDGTFS RSDFTWDDEN DRYICPSGKE MRHTWRTYSD PARNAPAWKA RRYRTRKSDC TGCALKAKCC PNSEVRAIHR EKYEIVRDFA RQCTASEYNP TAQRRRKKVE MLFAHLKRIL GLGRLRLRGP CGVQDEFTLA ATAQNLRKLA KLKPMVPATE
|
| |