Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3609 |
Symbol | |
ID | 8449228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 3961308 |
End bp | 3964238 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645042679 |
Product | transposase Tn3 family protein |
Protein accession | YP_003202915 |
Protein GI | 258653759 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0836864 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0424125 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCAGCG ATGACCTGTC CTGGGTCGCG GGCTTCAACC GGGTCGAGAA CCGGTTGGGT GTGATGGTGC AGCTGTGCGC CTTGCCGTGG CTGGGCTGGA TCCCGGACGA CCTGAGCGCG TGCCCGGCAG CCGCCCTGGA CCGCTTGGCC GCCGCATTGG CGGTCGCCCC TGATCAGGCG GCCGGCTTGC TGGTGGCCTA TGGCGGCTGG CGGGGAGAGA CCCGACGCAC CCACCGGGCC CAGGTGCTGG CCCGGCTCGG CTGGCGCTGG TGTGCCGCCG GCGAGCGCAA GCAGCTCGAT GAATTCCTGC TCGCCCGCGC GTTGGAGCAC GACGCGCCCA CGCTGCTGCT GCAGATGGCC TGCGACTGGC TACGCGGGGA ACACATCGTC CGGCCGGCGG CCGACGCGCT GACGAGGCGG ATCGCGTCCG CGCGTGATGC CGCCCGCGCG GAGACCTATC ACCGGCTCCG ACCGTTGCTG TCCCCGCCTC GGCCACGCCA GCTCGACGGG CTGCTCGACG TCGACCCGGA TCTGGGGATC ACCCGGCTGA CCTGGCTGCG CCGCGGCGCG ACGGCCGCGA CCCCGGAGGT GCTCAAGGCC GAGATCGACA AGCTGGAATT CCTGCGTGGG CACGGCGCCG ACACCCTGGA CCTGTCCCGG CTGCCGGCCG GCCGGCGCCG GCTGCTGGCC GAGATCGGCC GGCGTTCCAC CAACCAAGCC CTGCAGCGTG CCGATGTCGA CCGCCGGCAT CCGGTGCTGC TGGCCACGCT CGCCGAGACC TACGTCGAGG TGCTCGACGA GCTGGTCCAA CTGCTCGACC AGGCCCTGGC CGGCGCGGAG TCCCGCGCCC GGCACGAGCT GTCGCAGCGG CTGGTCGACC GCGCGAAGGC CGAAGCCGAC CGCGCCCGGC TGCTCGATGA GATCCTCGAC GTGCTGGCCG ATCCGAGCGT CGCCGACGCC GCCGCCGGCC GCCTGGTGCG CCAACGGGTC GGTATGCCGC GTCTGGTCGC TGCGCGCCGG CCCGCCGGGG AACGCGAGCA GCGCGACCAC GGCCACTTCG ATCTGCTCGC CGCCCGCTAC AAATACCTGC GCACCTTCAC CCCAGCCGTG ATTGCGTCGC TGCCGCTGAC CGGCAACACC GCCAGCCCGG CCGTCAGGTC CCTGCTCGAC GCAGTCGCCG TGCTGCGGGA GCTGAACACG GCCGGACGAA GCATGGTGCC CGACGACGCG GCCACGGCGG AGGCGACGTC ATTCGTCCCG GCCCGCTGGC GCGACTACCT GGACGCGACC CGCGGACAGG GCCGCGGCGC GGCCTACCGG CACTACTGGG AACTCGCCGT GCTGTACGGG GTGCAGGCCG GGCTGCGTTC GGGTGACCTG TGGGTGCCCG GCTCACGCCG GTACACCGAC CCCGCCGCCC TGCTGTTGCC GGTCGAGCGG TGGGCCGTCC AACGCGACGA CTTCTGCACC CTCACCGGAG CCGACGCGAA CCCGCACCGG CAACTCGACC GACTCGACGG CGAACTGCAC TCGGCGATCG CCTCTCTGGA GGCCGTGCTG GCCGACCCCT CGGCCGAAGG CCTGGCCCGC CTCGGCGACG ACGGTGATCT GATCGTGTCG CCGCTCGCCG CTGAGCAGGT CCCGGCCGCA GCGGACGAAC TCGCCGCGGC GTGTGCCACC CGGCTACCGC GCGTGCAGCT GCCGGCACTG CTGATCGAGG TCGACCAGAT GACCGGGTTC AGCCAGGAGT TCACCCATGC CGGCGGCGCC CAGCCCCGCA ACCCTGATCT GCGCCGCAAC CTGTACGCGG CGTTGATCAC CTACGCCTGC AACCTCGGCT ACGCCGGGAT GGCCGACGCC TCCGGCATCT CCGAAGACCA ACTGGCCTGG ACCTCCCAGT GGTACCTGCG GCAAGACACG CTGCGCGCAG CAAACACCCG GCTGGTCAAC GCCCACCACG CGAATCCACT CGCTGCCCTG TGGGGCGGCG GCACCCTGTC CTCGTCCGAC GGGCAACGGT TCCCGCAGCG CGGCCGCAGC CTCACCGCCC GCGCCCTGTC CCGGTACTTC CTCGACGAGG GCACCACCAC TTACACCCAC GTCTCCGATC AACACTCCAC GTACGGCACC AAGGTCATCC CGACGACCTG GCGCGAAGCC GTCGCCGTGC TGGACGAGAT CTTCGGCAAC CCCACCGATC TGCCGCTCGG CGAGCACACC GTCGACACCG CCGGCCAAAC GCTGGCGACG TTCGCGATCT TCCACCTCGC CGGGTTGCAG TTCTCCCCAC GCATCCGCGA CATCGGCCGC CTACAGCTCT ACCGCCTCGG CGCAGCATCG ACCTGGCGCG CCCGCTACCC GCACGCCGGA CCGCTGCTCG GCCAACCGAT CCAGACCCAG CTGATCGCCG AGCACTGGAA CGACATGCTC CGCCTGGTGG GCTCGATGAA GTTCGGGCAC ACCACCGCCA GCCTGCTCAT CGCCAAGCTG CACGCCAGCA GTCGGCAATC CAGCCTGGCC AGGGCGCTGC ACGAGTACGG CCGGCTGATC CGCACGATCT ACGTCTGCCG TTACGTCGCC GACGAAGAAC TCCGTCGCCG GGTGCGGCGT CAGCTGAACA AGGGCGAGAG CCTGCACGCG CTGCGCCGCG ACCTGTTCTT CGCCCACCAA GGCCACGTCC GCCGACGGCA CCTCGACGAC CAGATCGACC AGGCCCTGTG CCTGACCCTG GTGACCAACG CCTGCGTGCT GTGGACCACC ACCTACCTCG CCGACGCGCT CGATGCCCTC CGCATGGAAG GACATGACGT CGACGACGAG ATCGCCGCCC ACCTCACCCC GCGCAGCACG ACCACATCAA CTTCTACGGC ACGTATTCCT TCGATCTCGA CGCCGAACTA CGCCGCGAAG GACACCGGCC ACTCCACGTG A
|
Protein sequence | MSSDDLSWVA GFNRVENRLG VMVQLCALPW LGWIPDDLSA CPAAALDRLA AALAVAPDQA AGLLVAYGGW RGETRRTHRA QVLARLGWRW CAAGERKQLD EFLLARALEH DAPTLLLQMA CDWLRGEHIV RPAADALTRR IASARDAARA ETYHRLRPLL SPPRPRQLDG LLDVDPDLGI TRLTWLRRGA TAATPEVLKA EIDKLEFLRG HGADTLDLSR LPAGRRRLLA EIGRRSTNQA LQRADVDRRH PVLLATLAET YVEVLDELVQ LLDQALAGAE SRARHELSQR LVDRAKAEAD RARLLDEILD VLADPSVADA AAGRLVRQRV GMPRLVAARR PAGEREQRDH GHFDLLAARY KYLRTFTPAV IASLPLTGNT ASPAVRSLLD AVAVLRELNT AGRSMVPDDA ATAEATSFVP ARWRDYLDAT RGQGRGAAYR HYWELAVLYG VQAGLRSGDL WVPGSRRYTD PAALLLPVER WAVQRDDFCT LTGADANPHR QLDRLDGELH SAIASLEAVL ADPSAEGLAR LGDDGDLIVS PLAAEQVPAA ADELAAACAT RLPRVQLPAL LIEVDQMTGF SQEFTHAGGA QPRNPDLRRN LYAALITYAC NLGYAGMADA SGISEDQLAW TSQWYLRQDT LRAANTRLVN AHHANPLAAL WGGGTLSSSD GQRFPQRGRS LTARALSRYF LDEGTTTYTH VSDQHSTYGT KVIPTTWREA VAVLDEIFGN PTDLPLGEHT VDTAGQTLAT FAIFHLAGLQ FSPRIRDIGR LQLYRLGAAS TWRARYPHAG PLLGQPIQTQ LIAEHWNDML RLVGSMKFGH TTASLLIAKL HASSRQSSLA RALHEYGRLI RTIYVCRYVA DEELRRRVRR QLNKGESLHA LRRDLFFAHQ GHVRRRHLDD QIDQALCLTL VTNACVLWTT TYLADALDAL RMEGHDVDDE IAAHLTPRST TTSTSTARIP SISTPNYAAK DTGHST
|
| |