Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3666 |
Symbol | |
ID | 8449285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4017149 |
End bp | 4019923 |
Gene Length | 2775 bp |
Protein Length | 924 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645042731 |
Product | transposase Tn3 family protein |
Protein accession | YP_003202967 |
Protein GI | 258653811 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.376312 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0687479 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGGG CAAGCCATCT GCCTGTCTAC GAAACCCGAT ACGGTCTACA GCTCGATGAG TTCCTGCTCG CCCGCGCGTT GGAGCACGAC GCGCCCACGC TGCTGCTGCA GATGGCCTGC GACTGGCTAC GCGGGGAACA CATCGTCCGG CCGGCGGTCG ACGCGCTGGC CCGGCGGGTC GCGTCCGCGC GTGATGCCGC CCGCGCGGAG ACCTATCACC GTCTCCGGCC GTTGCTGTCC CCGCCTCGGC CGCGACAGCT CGACGGGCTG CTCGACGTCG ACCCGGATCT GGGGATCACC CGGCTGACCT GGCTGCGCCG CGGCGCGACG GCCGCGACCC CGGAGGTGCT CAAGGCCGAG ATCGACAAGC TGGAATTCCT GCGTGGGCAC GGCGCCGACA CCCTGGACCT TTCCCGGCTG CCGGCCGGCC GGCGCCGGCT GCTGGCCGAG ATCGGCCGGC GTTCCACCAA CCAAGCCCTG CAGCGTGCCG ATGTCGACCG CCGGCATCCG GTGCTGCTGG CCACGCTCGC CGAGACCTAC GTCGAGGTGC TCGACGAGCT GGTCCAACTG CTCGACCAGG CCCTGGCCGG CGCGGAGTCC CGCGCCCGGC ACGAGCTGTC GCAGCGGCTG GTCGACCGCG CGAAGGCCGA AGCCGACCGC GCCCGGCTGC TCGATGAGAT CCTCGACGTG CTGGCCGATC CGAGCGTCGC CGACGCCGCC GCCGGCCGCC TGGTGCGCCA ACGGGTCGGT ATGCCGCGTC TGGTCGCTGC GCGCCGGCCC GCCGGGGAAC GCGAGCAGCG CGACCACGGC CACTTCGATC TGCTCGCCGC CCGCTACAAA TACCTGCGCA CCTTCACCCC AGCCGTGATT GCGTCGCTGC CGCTGACCGG CAACACCGCC AGCCCGGCCG TCAGGTCCCT GCTCGACGCA GTCGCCGTGC TGCGGGAGCT GAACACGGCC GGACGAAGCA TGGTGCCCGA CGACGCGGCC ACGGCGGAGG CGACGTCATT CGTCCCGGCC CGCTGGCGCG ACTACCTGGA CGCGACCCGC GGACAGGGCC GCGGCGCGGC CTACCGGCAC TACTGGGAAC TCGCCGTGCT GTACGGGGTG CAGGCCGGGC TGCGTTCGGG TGACCTGTGG GTGCCCGGCT CACGCCGGTA CACCGACCCC GCCGCCCTGC TGTTGCCGGT CGAGCGGTGG GCCGTCCAAC GCGACGACTT CTGCACCCTC ACCGGAGCCG ACGCGAACCC GCACCGGCAA CTCGACCGAC TCGACGGCGA ACTGCACTCG GCGATCGCCT CTCTGGAGGC CGTGCTGGCC GACCCCTCGG CCGAAGGCCT GGCCCGCCTC GGCGACGACG GTGATCTGAT CGTGTCGCCG CTCGCCGCTG AGCAGGTCCC GGCCGCAGCG GACGAACTCG CCGCGGCGTG TGCCACCCGG CTACCGCGCG TGCAGCTGCC GGCACTGCTG ATCGAGGTCG ACCAGATGAC CGGGTTCAGC CAGGAGTTCA CCCATGCCGG CGGCGCCCAG CCCCGCAACC CTGATCTGCG CCGCAACCTG TACGCGGCGT TGATCACCTA CGCCTGCAAC CTCGGCTACG CCGGGATGGC CGACGCCTCC GGCATCTCCG AAGACCAACT GGCCTGGACC TCCCAGTGGT ACCTGCGGCA AGACACGCTG CGCGCAGCAA ACACCCGGCT GGTCAACGCC CACCACGCGA ATCCACTCGC TGCCCTGTGG GGCGGCGGCA CCCTGTCCTC GTCCGACGGG CAACGGTTCC CGCAGCGCGG CCGCAGCCTC ACCGCCCGCG CCCTGTCCCG GTACTTCCTC GACGAGGGCA CCACCACTTA CACCCACGTC TCCGATCAAC ACTCCACGTA CGGCACCAAG GTCATCCCGA CGACCTGGCG CGAAGCCGTC GCCGTGCTGG ACGAGATCTT CGGCAACCCC ACCGATCTGC CGCTCGGCGA GCACACCGTC GACACCGCCG GCCAAACGCT GGCGACGTTC GCGATCTTCC ACCTCGCCGG GTTGCAGTTC TCCCCACGCA TCCGCGACAT CGGCCGCCTA CAGCTCTACC GCCTCGGCGC AGCATCGACC TGGCGCGCCC GCTACCCGCA CGCCGGACCG CTGCTCGGCC AACCGATCCA GACCCAGCTG ATCGCCGAGC ACTGGAACGA CATGCTCCGC CTGGTGGGCT CGATGAAGTT CGGGCACACC ACCGCCAGCC TGCTCATCGC CAAGCTGCAC GCCAGCAGTC GGCAATCCAG CCTGGCCAGG GCGCTGCACG AGTACGGCCG GCTGATCCGC ACGATCTACG TCTGCCGTTA CGTCGCCGAC GAAGAACTCC GTCGCCGGGT GCGGCGTCAG CTGAACAAGG GCGAGAGCCT GCACGCGCTG CGCCGCGACC TGTTCTTCGC CCACCAAGGC CACGTCCGCC GACGGCACCT CGACGACCAG ATCGACCAGG CCCTGTGCCT GACCCTGGTG ACCAACGCCT GCGTGCTGTG GACCACCACC TACCTCGCCG ACGCGCTCGA TGCCCTCCGC ATGGAAGGAC ATGACGTCGA CGACGAGATC GCCGCCCACC TCACCCCGCG CAGCACGACC ACATCAACTT CTACGGCACG TATTCCTTCG ATCTCGACGC CGAACTACGC CGCGAAGGAC ACCGGCCAGG GACTGTCCAA CAAACGGTGT TTCTGGCCTG ACACGCTGGG CGTTGCTCGG CGGGGAAGGT GCCGTGATGA CCGCGACACT CAATGGTGTG CCTAG
|
Protein sequence | MTRASHLPVY ETRYGLQLDE FLLARALEHD APTLLLQMAC DWLRGEHIVR PAVDALARRV ASARDAARAE TYHRLRPLLS PPRPRQLDGL LDVDPDLGIT RLTWLRRGAT AATPEVLKAE IDKLEFLRGH GADTLDLSRL PAGRRRLLAE IGRRSTNQAL QRADVDRRHP VLLATLAETY VEVLDELVQL LDQALAGAES RARHELSQRL VDRAKAEADR ARLLDEILDV LADPSVADAA AGRLVRQRVG MPRLVAARRP AGEREQRDHG HFDLLAARYK YLRTFTPAVI ASLPLTGNTA SPAVRSLLDA VAVLRELNTA GRSMVPDDAA TAEATSFVPA RWRDYLDATR GQGRGAAYRH YWELAVLYGV QAGLRSGDLW VPGSRRYTDP AALLLPVERW AVQRDDFCTL TGADANPHRQ LDRLDGELHS AIASLEAVLA DPSAEGLARL GDDGDLIVSP LAAEQVPAAA DELAAACATR LPRVQLPALL IEVDQMTGFS QEFTHAGGAQ PRNPDLRRNL YAALITYACN LGYAGMADAS GISEDQLAWT SQWYLRQDTL RAANTRLVNA HHANPLAALW GGGTLSSSDG QRFPQRGRSL TARALSRYFL DEGTTTYTHV SDQHSTYGTK VIPTTWREAV AVLDEIFGNP TDLPLGEHTV DTAGQTLATF AIFHLAGLQF SPRIRDIGRL QLYRLGAAST WRARYPHAGP LLGQPIQTQL IAEHWNDMLR LVGSMKFGHT TASLLIAKLH ASSRQSSLAR ALHEYGRLIR TIYVCRYVAD EELRRRVRRQ LNKGESLHAL RRDLFFAHQG HVRRRHLDDQ IDQALCLTLV TNACVLWTTT YLADALDALR MEGHDVDDEI AAHLTPRSTT TSTSTARIPS ISTPNYAAKD TGQGLSNKRC FWPDTLGVAR RGRCRDDRDT QWCA
|
| |