Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1747 |
Symbol | |
ID | 6975162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 1929685 |
End bp | 1930764 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643391270 |
Product | transposase IS4 family protein |
Protein accession | YP_002276127 |
Protein GI | 209543898 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3666] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.25501 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCAGC CGGGATTTTT TGACGTTGAA GAGCGGCTTG CCCGGTTAAG CGGGCTTGGC GATCAGCTCG AAGCATTTTC CCGGACTGTA GATTTTGAAG CGTTCCGTCC TGATCTGGAG AAGGCTCTGG CCTATTCAGA TGGAAGCAAA GGCGGGCGAC CGCCATTTGA TCCGTTGCTA ATGTTCAAGA TCCTGGTCAT CCAGACGCTC AACAATTTGT CTGATGAGCG CACGGAGTAT CTGATCAACG ACCGCCTGTC CTTCATGCGC TTCCTTGAGC TGGGGCTTTC AGATCGAGTT CCGGATGCCA AAACAATCTG GCTGTTCCGT GAACGCCTGA CCCAGGCGGG AGCGATCGAG GGTCTGTTCA ATCGCTTTGA TACAATGCTG CGGCACGCAG GCTATCTGCC GATGTCGGGC CAGATCCTGG ATGCCACACT GGTGGCTGCT CCAAAGCAGC GCAATACCAA CGCCGAGAAA GCCGACCTCC GGGCAGGCCG TATTCCCGAA AACTGGCAGT ACAAGCCGTC AAAGCTGTCG CACAAGGATC GTCATGCGCG CTGGACACTG AAGTTTACGA AGGCGAAGCG TCAGGATGAC GGAACAACCC CCACAACGGA TCTCGCTATC CCGTTCTTTG GCTATAAATC GCATGTTTCC ATCGATCGGA AATACCGGTT CATCCGGAAA TGGAAAACAA CGCATGCCGC CGCCAATGAT GGCGCGCGAT TGAGAGAGGG GCTGCTGGAT AAAACCAATA CGGCCTCAAA CGTCTGGGCT GACACAGCCT ATCGCTCAAA AGCCAACGAA GACTTCATGG AAAAGCAGGT CTTTGTCTCA AAGGTTCACA GGAAGAAGCC GCATCTCAAA CCCATGCCCC GCCATATCCA GCGGTCCAAT GCAGGAAAGT CCGTGATCCG GTCCCGTGTC GAGCATGTCT TTGCCGATCA GAAGTCGCAG ACGGGACTGT TCATCCGAAC TGTCGGTATC ACCCGGTCCA CCATGAGGAT CGGGCTGGCC AATATCGTCT ACAATATGCG CCGCTTTCTC TTCCTGCAGA AGATCAGCGC GAGCGCGTAG
|
Protein sequence | MKQPGFFDVE ERLARLSGLG DQLEAFSRTV DFEAFRPDLE KALAYSDGSK GGRPPFDPLL MFKILVIQTL NNLSDERTEY LINDRLSFMR FLELGLSDRV PDAKTIWLFR ERLTQAGAIE GLFNRFDTML RHAGYLPMSG QILDATLVAA PKQRNTNAEK ADLRAGRIPE NWQYKPSKLS HKDRHARWTL KFTKAKRQDD GTTPTTDLAI PFFGYKSHVS IDRKYRFIRK WKTTHAAAND GARLREGLLD KTNTASNVWA DTAYRSKANE DFMEKQVFVS KVHRKKPHLK PMPRHIQRSN AGKSVIRSRV EHVFADQKSQ TGLFIRTVGI TRSTMRIGLA NIVYNMRRFL FLQKISASA
|
| |