Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_4289 |
Symbol | |
ID | 4443540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008538 |
Strand | - |
Start bp | 22253 |
End bp | 25219 |
Gene Length | 2967 bp |
Protein Length | 988 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639687610 |
Product | transposase Tn3 family protein |
Protein accession | YP_829307 |
Protein GI | 116662253 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCTCTGC GTTCTTTGCT CACGGCTGCC GAGCGCGGCC AAATCCTGGC AATGCCCGCT GAAACAGAGG ACCTTGCAGC CCACTACACG CTCAGTGATG CGGATATGTC GTTGATCCGG CAACGCCGCG GGGACGCGAA CAGGCTGGGC TTTGCGGTCC AGCTGTGTCT GCTGCGCCAC CCGGGCATCG GGCTGGCCGA CGACACCGAC GTGCCGCCGG AACTCATTGC CTGGCTGGCC TCCAGCCTGG GCGTTTCCAT TGATGCCTGG GACGAGTATG GAACACGTGA GGAAACGCGG CAGGAGCATG GACGGGAGAT CCGCGGGTAT CTGGGCATGT CAGCGTTCGG CATCGCGGAC TACCGCTGGC TCGTGGAGCA TGTGGGTGTT GTGGCTGCGC ACACGGACAA GGGCCTGGTC CTGGTCGAAA GCGCGAGGGA TTTCCTGCAG GCAAGGAAGG TCGCGTTGCC CGGGATCGGG GTCATTGAAA AGGCCTGTGC GCAGGCGCTG ACCAGGGCCA ATAGGCGGAT TTACCTCACT TTGGGTGAGC AGCTGACCGT GGGTCACCGG CAACGCCTCG ACGGGCTGCT GCGCCGCCGC CGCGATAGTT CTCTGACGGA AATCGGTTGG CTGCGGCAGG CGCCGCTGCG ACCCAACGCG CGGGCGATGA ATGAGCACAT CGACCGGCTC ACCACCTGGC GTGCGCTGGA GCTGCCCTGG GCTGCGGGAC GGCTGGTTCA TCGGAACAGG TTGCTGAAGC TTGCCCGGGA GGGTGCCTCG ATGACCGCGG CGGATCTGGC CAGATTTGAA CCGGCACGCC GGTACGCGAC CCTCTTCGCC ATGGCCACCG AAAGCATGGC CACCGTCACC GACGAAATCA TCGATCTGCA CGACCGGATC ATCGGCCGGC TCATCCGGAC CGCACAGAAC AAGCAAAACC AGGCCACCCT GGCATCCCGC TCCACCGTCG CCGCCATGAT GCGCATCCAT TCCAGGCTCG GTGATGCCCT CTTTGAGGCC AAGGAAAACG GCGAAGATCC CTTCGCCGCC ATCGAAACGG CCATCGGCTG GGAATCCCTG GCCGAAAGCA TCGCCCACGC CAAGGAACTG ACCCGCCCGG CCCTCGAGGA CCCCCTCGCC CTCGTCAGCG CCCACTTCAC CACCCTGCGC CGCTACACCC CGGCATTTCT TGCCGTCCTT GACCTCAACG CGGCCCCGGC GGCACAGGAC CTGCTGGCAG CAATCAACCT CGTCCGCACC CTGAACACCG CCGGAGCCCG AAAAATCCCC GACGATGCGC CCACCTCGTT CGTCCGGCCC CGGTGGAAGC CGCTGGTCTT CACGGAAAAC GGCATAGACC GGGGCTTCTA CGAGTTCTGT GCCCTCGCGG AACTAAAGAA TGCGCTTCGT TCCGGAGACT TGTGGGTCAC CGGATCCCGT CAATTCCGTG ACTTCGATGA CTACCTTCTT GCCGGTCCTG ACTACACGGT TATGAAAACC ACCGGGAAGC TGCCTCTGGT CACGACCGAC GGCGGCGAAA GCTATCTCCA GAACCGGCTG GCCCTGCTCA ACGAACGGCT GCACCATGTC AACGACCTCG CCTCCCGCGA TGAACTGCCC GGGGTGATGG TCACGGACAA GGGCGTGAAA ATCACCCCGC TGGAGACAAT CGTGCCAAAA CACGCGCAGC CACTGATCGA TCAGGCAAGC GCAATGTTTC CGCGGATCCG GATCACCGAT CTGTTGATGG AAGTTGATGG CTGGACCGGG TTCACCCGCC ATTTCACCAG CCTGAAATCC GGCCAGCCCT CCAAAGACAA GCAACTTCTT CTCACCGCCA TCCTCGCGGA CGGAATCAAC CTGGGCCTGA CGAAGATGGC CGAGTCATGC ACCGGCGTCA GCTACGCCCA GCTGGACCGC CACCAGGCCT CCTACATCCG GGACGAAACC TACAGCGCCG CTCTGGCGGA ACTGGTGAAC ACCCAGCACG GACACCCCTT CGCCGCACAG TGGGGCGACG GGACCACCTC CTCATCGGAC GGGCAGCGGT TCCGTGCCAG CAGCAAAGCC GAATCCACCG GGCATGTGAA CCCCAAGTAC GGTGCCGAGC CCGGCCGGCT GATCTACACA CACATCTCGG ACCAGTACTC GCCCTTCCAC AGCAAGCTCG TCAACGTCGG CGACCGCGAC GCGACCTACG TCCTGGACGG GCTGCTCTAC CACGAGTCCG ACCTGGCGAT CCAGGAGCAC TACACGGATA CGGCCGGATT CACCGATCAC CTCTTCGCTC TTATGCACCT GCTCGGGTAC CGGTTCGCCC CACGGATCCG CAACATCGGC GACACCCGTC TCTACACACC TACCACCGAT CCGGGACTTG CCACGTTGGC GCCGCTGATC GGCGGGACCA TCAACACGAA AATGATTGCC CTGCATTGGG ATGAAATCCT CCGCCTCGCC GCGTCCATCA AGACCGGCAC CGTGACCGCG TCCCTGATGA TGCGAAAACT CGGCGCCTAC CCGCGCCAGA ACGGGCTCGC ACTCGCGCTG CGGGAGCTGG GCAGACTGGA GCGGACCCTC TTCCTGCTGG ACTGGCTCCA GAACCCCGGC CTGCGCCGCA AAGTCACGGC CGGCCTGAAC AAGGGCGAGG CCCGGAACAC CCTCGCCCGG GCCGTCTTCT TCAACCGCCT CGGCGAAATC CGCGACCGCT CCTTCGAACA GCAACGCTAC CGCGCCAGCG GACTGAACCT TCTCACCGCG GCCATTATTC TCTGGAACAC CGTCTACCTC GACCGCACCA TCACCACCCT CAATAAGGAC GGGAACGCCA CGGACCCTGA CCTGCTGCGG TTCCTCTCAC CCCTGGGCTG GGAACACATC AACCTCACCG GCGACTACAC CTGGCCCCGC GCCAACCAGA TCAAACCCGG CAAATACAGG CCACTACGCC GCCCGGCAAA ACCTTAA
|
Protein sequence | MALRSLLTAA ERGQILAMPA ETEDLAAHYT LSDADMSLIR QRRGDANRLG FAVQLCLLRH PGIGLADDTD VPPELIAWLA SSLGVSIDAW DEYGTREETR QEHGREIRGY LGMSAFGIAD YRWLVEHVGV VAAHTDKGLV LVESARDFLQ ARKVALPGIG VIEKACAQAL TRANRRIYLT LGEQLTVGHR QRLDGLLRRR RDSSLTEIGW LRQAPLRPNA RAMNEHIDRL TTWRALELPW AAGRLVHRNR LLKLAREGAS MTAADLARFE PARRYATLFA MATESMATVT DEIIDLHDRI IGRLIRTAQN KQNQATLASR STVAAMMRIH SRLGDALFEA KENGEDPFAA IETAIGWESL AESIAHAKEL TRPALEDPLA LVSAHFTTLR RYTPAFLAVL DLNAAPAAQD LLAAINLVRT LNTAGARKIP DDAPTSFVRP RWKPLVFTEN GIDRGFYEFC ALAELKNALR SGDLWVTGSR QFRDFDDYLL AGPDYTVMKT TGKLPLVTTD GGESYLQNRL ALLNERLHHV NDLASRDELP GVMVTDKGVK ITPLETIVPK HAQPLIDQAS AMFPRIRITD LLMEVDGWTG FTRHFTSLKS GQPSKDKQLL LTAILADGIN LGLTKMAESC TGVSYAQLDR HQASYIRDET YSAALAELVN TQHGHPFAAQ WGDGTTSSSD GQRFRASSKA ESTGHVNPKY GAEPGRLIYT HISDQYSPFH SKLVNVGDRD ATYVLDGLLY HESDLAIQEH YTDTAGFTDH LFALMHLLGY RFAPRIRNIG DTRLYTPTTD PGLATLAPLI GGTINTKMIA LHWDEILRLA ASIKTGTVTA SLMMRKLGAY PRQNGLALAL RELGRLERTL FLLDWLQNPG LRRKVTAGLN KGEARNTLAR AVFFNRLGEI RDRSFEQQRY RASGLNLLTA AIILWNTVYL DRTITTLNKD GNATDPDLLR FLSPLGWEHI NLTGDYTWPR ANQIKPGKYR PLRRPAKP
|
| |