Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ajs_1842 |
Symbol | |
ID | 4671472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidovorax sp. JS42 |
Kingdom | Bacteria |
Replicon accession | NC_008782 |
Strand | - |
Start bp | 1922640 |
End bp | 1925678 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639838930 |
Product | transposase Tn3 family protein |
Protein accession | YP_986105 |
Protein GI | 121594209 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.550818 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAGCA TTGAACGGAC TGCGTACCCT CGATTTCCTC GAATATTGAC GCTCAAAGAC CTGCAAGCAT CGTTCACTCC GAAGCCCGAG GAGATTGAAT GGGCGCAGCA ACAGTCACGC ACTCCGCAAC GTCGGCTGGC ATTGTTGGTG CTGCTCAAAT GCTTTGAGTT CCTGCGACAC TTCCCGGCGC TCGATGCGAT TCCCGCCGAG GTTGTCGAGA ATATCTCAGC AACGCTGGGC GTCGCTCCGA CTGACAAGAT CGAATTCGCT TCGCTCGCCA CGCTGTATCG ACACCACAAA GTGATTCGAG AACTGCTTGG TGTGAAGCCC TACACAGATG GGCAAACACG AAGGCTGGCT ATCCAGATAG CCCAACAGGT CGCCAATGTC GTGGAGACCC GTGTTGACAT CATCAACATC ACCATCGAAG AGTTGGTGCG GCTGGGTTAC GAGCTGCCAG TGTTCCGCAC GCTAGACGAA ATCTCCGAAC AAGCCCACTC GGCCGCGGAG GCCGCATTGA ATGCACGAAT CAACCAGCGC CTGAGTTCGG CCCAAAGAAC GTGGCTTGAC CGATTGCTGG AGTCGGAATT ACCTGCAAGG CGAACTCTGT ACAACCAGAT CAAGAAATCG GCAAAAAAAG CCTCGCGCAA GCATCTTGAT TTGCTGTTGG ACCAGTTGAC ATGGCTTGAG TCACTGCCTG ACAGCGATGC GCTGCTCGAC GGCATTCCGG TTACCAAGCT GAACTACATG GCTGGCATGG CATCGGCGCT CGATGCGGGT GACATGAAGG ACTTACTGCC CGCCAAGCGC TACACATTGA TCCTGGCATT GATCCGACAG ATGCGCGTGA GGGCTCGCGA CGATGTGGCA GAAATGTTCA TTCGCCGCAT CGGAGCCATC CACAGGACCG CAAGAGAGGA ACTGCAGGTG ATCCAGGCGC GCCAGCGCGA ACTGAGCGAA GAACTCGTCG CCACGCTGGA GCAGGTGCTG GAGATCCTCG CAGAAAACCT GGATGACGCG AGTACCGGGC AGCGCGTTCG AGATCTGCTT GCCCCGCACG GTAGCCTGGA TCAACTGACG ATGGACTGTG AGGCCATCCG TGTGTGGAGT GGCGGTAACC ATCTACCGCT GGTCTGGAAG GCATTCAGCA GTTGGCGAGC GGCGATGTTC CGGATGGCTA AGGCGTTGCG CTTTGAGGCT GCTACCCAGG ACCGCAGTCT GCTGGATGCC CTTCAGGTCG TGCTGGCCAA CGAGCATCGC AAGGTGGAGT GGATCGAGGA TGACCTGGCG CTATCGTTCG CCTCCGAACG TTGGCGCAAG CTGGTGCGTC GTTCGCATGG GCTGGGCCAT CCGACGAACC GCCGCTACCT CGAAGTGTGT GTGTTCAGCT ACTTGTGTGG TGATCTTCGC TCTGGCGATG TTTGCATTGA AGGCTCAGAG TCATTTGCCG ACTACCGCTT GCAACTGCTG CCCTGGAAGG AGTGCGAAGC GCAGTTGCCA ACCTATTGCG ACCGGATTGG AATACCAGCC ACTGCGGGTG ACTTCGTAGA CGGACTCAAG CAATTGCTCA CCGAGACGGC CAAGAAAATC GATGATGAGT TTCCCCAGCA TGCCGGAGAC GTGGTGATCG GTTCCAACGG TGAGCCGACG CTGCGGCGTG TTGCCGCGCG CGAAGTTCCG GCATCTGCCA TCGCACTGCA TGCCGCTATC GAAAATCGCA CGGCCCCTCG CAACCTGCTG GATGTTCTCG CCAACATCGA GCACTGGACG GGATTCACCC GAAACTTTGG GCCGCAGTCT GGCGACGATC CTAAACTTCG CAATGCGCGT GAGCGCTACT TGCTGACCGT TTTTGCCATG GGCTGCAACC TGGGGCCCAA TCAGGCCGCG CGCCATCTGT CCAACGGTGT GACACCGCAT CAACTGTCCT ATGCCAACCA GCGGCACATG AGCCTGGACC AGTTGGACAA CGCCTGTCGG GATCTGACCG AGCTGTATCT GCGGCTTGAG CTGCCCAAGC TGTGGGGCGA AGGCAAAAAA GTCGCGGCAG ATGGCACTCA GCACGACTTC TACGACCAGA ACCTGCTGGT GGGCATGCAT TTCCGGTACC GGCGCATGGG GGCCGTGGCC TATCGGCATG TCGCCGACAA CTACATCGCG GTTTTTCGCC ACTTCATACC ACCAGGGGTA CTGGAGGCGG TCTACGTCAT CGAGGGCCTG ATGAAGGCAG GCTTGAGCGT TCAGGCCGAT ACCGTGTACT CGGACACCCA TGGCCAGTCT GAAACGGTGT TCGCCTTCAC GCACTTGGCG GGTATCCAAC TGATGCCGCG CATCCGAAAC TGGAAGGATC TGCGTTTCTA TCGCCCCGAG AAGGGGATGC GGTTCCGGCA TATCGACCGC CTGTTCAGCG ACGTTGTGGA CTGGAAGCTC ATCCGCGACC ATTGGCGAGA TCTGATGCAG GTATCGATAT CGATCCAGGC CGGAAAAATC GCCTCACCGA TGCTGTTGCG CAAGCTCAGC CAAGAAGGGC GTCACAACCG ACTGTTTGCT GCGGCACGCG AACTTGGCCG CGTGCTGCGC ACCGTCTACT TACTGCGTTG GATTTCCAGC AAGGAAATGC GACAGGAGGT GAGCGCAACG ACCAACAAGA TCGAGTCATA CCATGCCTTC ACAAAATGGC TGGACTTCGG TGGCGACGTG ATCAATGAGA ACGATCCGAA CGAGCAGCAA AAGCGGGTGC GCTTCATCGA TCTCGTTGCC TCATCGGTGA TCCTGCAAAA CACGGTGGAC ATGATGAGGG TGTTGCAGGA AATGTACGCA GATGGGGAGC CGGTTTCTGC GGCCGACGTT GAGTACCTGA GTCCCTACAT GACCTCGGGT ATCAAGCGGT TTGGCAACTA TCACCTCGAC CTGAAGCGGC CCCCGGAGCC CTGGGTCAAG GAATCACAGT TTCGTGAGGC GGCCAAGCGT GCACGAGCTG CGGCAGCTGC AGTGGCAGCG GAGGGGCAGC GAGCCGGCCA AAAGGGAGCA GGAGCATGA
|
Protein sequence | MASIERTAYP RFPRILTLKD LQASFTPKPE EIEWAQQQSR TPQRRLALLV LLKCFEFLRH FPALDAIPAE VVENISATLG VAPTDKIEFA SLATLYRHHK VIRELLGVKP YTDGQTRRLA IQIAQQVANV VETRVDIINI TIEELVRLGY ELPVFRTLDE ISEQAHSAAE AALNARINQR LSSAQRTWLD RLLESELPAR RTLYNQIKKS AKKASRKHLD LLLDQLTWLE SLPDSDALLD GIPVTKLNYM AGMASALDAG DMKDLLPAKR YTLILALIRQ MRVRARDDVA EMFIRRIGAI HRTAREELQV IQARQRELSE ELVATLEQVL EILAENLDDA STGQRVRDLL APHGSLDQLT MDCEAIRVWS GGNHLPLVWK AFSSWRAAMF RMAKALRFEA ATQDRSLLDA LQVVLANEHR KVEWIEDDLA LSFASERWRK LVRRSHGLGH PTNRRYLEVC VFSYLCGDLR SGDVCIEGSE SFADYRLQLL PWKECEAQLP TYCDRIGIPA TAGDFVDGLK QLLTETAKKI DDEFPQHAGD VVIGSNGEPT LRRVAAREVP ASAIALHAAI ENRTAPRNLL DVLANIEHWT GFTRNFGPQS GDDPKLRNAR ERYLLTVFAM GCNLGPNQAA RHLSNGVTPH QLSYANQRHM SLDQLDNACR DLTELYLRLE LPKLWGEGKK VAADGTQHDF YDQNLLVGMH FRYRRMGAVA YRHVADNYIA VFRHFIPPGV LEAVYVIEGL MKAGLSVQAD TVYSDTHGQS ETVFAFTHLA GIQLMPRIRN WKDLRFYRPE KGMRFRHIDR LFSDVVDWKL IRDHWRDLMQ VSISIQAGKI ASPMLLRKLS QEGRHNRLFA AARELGRVLR TVYLLRWISS KEMRQEVSAT TNKIESYHAF TKWLDFGGDV INENDPNEQQ KRVRFIDLVA SSVILQNTVD MMRVLQEMYA DGEPVSAADV EYLSPYMTSG IKRFGNYHLD LKRPPEPWVK ESQFREAAKR ARAAAAAVAA EGQRAGQKGA GA
|
| |