Gene Namu_3609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3609 
Symbol 
ID8449228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3961308 
End bp3964238 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content71% 
IMG OID645042679 
Producttransposase Tn3 family protein 
Protein accessionYP_003202915 
Protein GI258653759 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0836864 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0424125 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCAGCG ATGACCTGTC CTGGGTCGCG GGCTTCAACC GGGTCGAGAA CCGGTTGGGT 
GTGATGGTGC AGCTGTGCGC CTTGCCGTGG CTGGGCTGGA TCCCGGACGA CCTGAGCGCG
TGCCCGGCAG CCGCCCTGGA CCGCTTGGCC GCCGCATTGG CGGTCGCCCC TGATCAGGCG
GCCGGCTTGC TGGTGGCCTA TGGCGGCTGG CGGGGAGAGA CCCGACGCAC CCACCGGGCC
CAGGTGCTGG CCCGGCTCGG CTGGCGCTGG TGTGCCGCCG GCGAGCGCAA GCAGCTCGAT
GAATTCCTGC TCGCCCGCGC GTTGGAGCAC GACGCGCCCA CGCTGCTGCT GCAGATGGCC
TGCGACTGGC TACGCGGGGA ACACATCGTC CGGCCGGCGG CCGACGCGCT GACGAGGCGG
ATCGCGTCCG CGCGTGATGC CGCCCGCGCG GAGACCTATC ACCGGCTCCG ACCGTTGCTG
TCCCCGCCTC GGCCACGCCA GCTCGACGGG CTGCTCGACG TCGACCCGGA TCTGGGGATC
ACCCGGCTGA CCTGGCTGCG CCGCGGCGCG ACGGCCGCGA CCCCGGAGGT GCTCAAGGCC
GAGATCGACA AGCTGGAATT CCTGCGTGGG CACGGCGCCG ACACCCTGGA CCTGTCCCGG
CTGCCGGCCG GCCGGCGCCG GCTGCTGGCC GAGATCGGCC GGCGTTCCAC CAACCAAGCC
CTGCAGCGTG CCGATGTCGA CCGCCGGCAT CCGGTGCTGC TGGCCACGCT CGCCGAGACC
TACGTCGAGG TGCTCGACGA GCTGGTCCAA CTGCTCGACC AGGCCCTGGC CGGCGCGGAG
TCCCGCGCCC GGCACGAGCT GTCGCAGCGG CTGGTCGACC GCGCGAAGGC CGAAGCCGAC
CGCGCCCGGC TGCTCGATGA GATCCTCGAC GTGCTGGCCG ATCCGAGCGT CGCCGACGCC
GCCGCCGGCC GCCTGGTGCG CCAACGGGTC GGTATGCCGC GTCTGGTCGC TGCGCGCCGG
CCCGCCGGGG AACGCGAGCA GCGCGACCAC GGCCACTTCG ATCTGCTCGC CGCCCGCTAC
AAATACCTGC GCACCTTCAC CCCAGCCGTG ATTGCGTCGC TGCCGCTGAC CGGCAACACC
GCCAGCCCGG CCGTCAGGTC CCTGCTCGAC GCAGTCGCCG TGCTGCGGGA GCTGAACACG
GCCGGACGAA GCATGGTGCC CGACGACGCG GCCACGGCGG AGGCGACGTC ATTCGTCCCG
GCCCGCTGGC GCGACTACCT GGACGCGACC CGCGGACAGG GCCGCGGCGC GGCCTACCGG
CACTACTGGG AACTCGCCGT GCTGTACGGG GTGCAGGCCG GGCTGCGTTC GGGTGACCTG
TGGGTGCCCG GCTCACGCCG GTACACCGAC CCCGCCGCCC TGCTGTTGCC GGTCGAGCGG
TGGGCCGTCC AACGCGACGA CTTCTGCACC CTCACCGGAG CCGACGCGAA CCCGCACCGG
CAACTCGACC GACTCGACGG CGAACTGCAC TCGGCGATCG CCTCTCTGGA GGCCGTGCTG
GCCGACCCCT CGGCCGAAGG CCTGGCCCGC CTCGGCGACG ACGGTGATCT GATCGTGTCG
CCGCTCGCCG CTGAGCAGGT CCCGGCCGCA GCGGACGAAC TCGCCGCGGC GTGTGCCACC
CGGCTACCGC GCGTGCAGCT GCCGGCACTG CTGATCGAGG TCGACCAGAT GACCGGGTTC
AGCCAGGAGT TCACCCATGC CGGCGGCGCC CAGCCCCGCA ACCCTGATCT GCGCCGCAAC
CTGTACGCGG CGTTGATCAC CTACGCCTGC AACCTCGGCT ACGCCGGGAT GGCCGACGCC
TCCGGCATCT CCGAAGACCA ACTGGCCTGG ACCTCCCAGT GGTACCTGCG GCAAGACACG
CTGCGCGCAG CAAACACCCG GCTGGTCAAC GCCCACCACG CGAATCCACT CGCTGCCCTG
TGGGGCGGCG GCACCCTGTC CTCGTCCGAC GGGCAACGGT TCCCGCAGCG CGGCCGCAGC
CTCACCGCCC GCGCCCTGTC CCGGTACTTC CTCGACGAGG GCACCACCAC TTACACCCAC
GTCTCCGATC AACACTCCAC GTACGGCACC AAGGTCATCC CGACGACCTG GCGCGAAGCC
GTCGCCGTGC TGGACGAGAT CTTCGGCAAC CCCACCGATC TGCCGCTCGG CGAGCACACC
GTCGACACCG CCGGCCAAAC GCTGGCGACG TTCGCGATCT TCCACCTCGC CGGGTTGCAG
TTCTCCCCAC GCATCCGCGA CATCGGCCGC CTACAGCTCT ACCGCCTCGG CGCAGCATCG
ACCTGGCGCG CCCGCTACCC GCACGCCGGA CCGCTGCTCG GCCAACCGAT CCAGACCCAG
CTGATCGCCG AGCACTGGAA CGACATGCTC CGCCTGGTGG GCTCGATGAA GTTCGGGCAC
ACCACCGCCA GCCTGCTCAT CGCCAAGCTG CACGCCAGCA GTCGGCAATC CAGCCTGGCC
AGGGCGCTGC ACGAGTACGG CCGGCTGATC CGCACGATCT ACGTCTGCCG TTACGTCGCC
GACGAAGAAC TCCGTCGCCG GGTGCGGCGT CAGCTGAACA AGGGCGAGAG CCTGCACGCG
CTGCGCCGCG ACCTGTTCTT CGCCCACCAA GGCCACGTCC GCCGACGGCA CCTCGACGAC
CAGATCGACC AGGCCCTGTG CCTGACCCTG GTGACCAACG CCTGCGTGCT GTGGACCACC
ACCTACCTCG CCGACGCGCT CGATGCCCTC CGCATGGAAG GACATGACGT CGACGACGAG
ATCGCCGCCC ACCTCACCCC GCGCAGCACG ACCACATCAA CTTCTACGGC ACGTATTCCT
TCGATCTCGA CGCCGAACTA CGCCGCGAAG GACACCGGCC ACTCCACGTG A
 
Protein sequence
MSSDDLSWVA GFNRVENRLG VMVQLCALPW LGWIPDDLSA CPAAALDRLA AALAVAPDQA 
AGLLVAYGGW RGETRRTHRA QVLARLGWRW CAAGERKQLD EFLLARALEH DAPTLLLQMA
CDWLRGEHIV RPAADALTRR IASARDAARA ETYHRLRPLL SPPRPRQLDG LLDVDPDLGI
TRLTWLRRGA TAATPEVLKA EIDKLEFLRG HGADTLDLSR LPAGRRRLLA EIGRRSTNQA
LQRADVDRRH PVLLATLAET YVEVLDELVQ LLDQALAGAE SRARHELSQR LVDRAKAEAD
RARLLDEILD VLADPSVADA AAGRLVRQRV GMPRLVAARR PAGEREQRDH GHFDLLAARY
KYLRTFTPAV IASLPLTGNT ASPAVRSLLD AVAVLRELNT AGRSMVPDDA ATAEATSFVP
ARWRDYLDAT RGQGRGAAYR HYWELAVLYG VQAGLRSGDL WVPGSRRYTD PAALLLPVER
WAVQRDDFCT LTGADANPHR QLDRLDGELH SAIASLEAVL ADPSAEGLAR LGDDGDLIVS
PLAAEQVPAA ADELAAACAT RLPRVQLPAL LIEVDQMTGF SQEFTHAGGA QPRNPDLRRN
LYAALITYAC NLGYAGMADA SGISEDQLAW TSQWYLRQDT LRAANTRLVN AHHANPLAAL
WGGGTLSSSD GQRFPQRGRS LTARALSRYF LDEGTTTYTH VSDQHSTYGT KVIPTTWREA
VAVLDEIFGN PTDLPLGEHT VDTAGQTLAT FAIFHLAGLQ FSPRIRDIGR LQLYRLGAAS
TWRARYPHAG PLLGQPIQTQ LIAEHWNDML RLVGSMKFGH TTASLLIAKL HASSRQSSLA
RALHEYGRLI RTIYVCRYVA DEELRRRVRR QLNKGESLHA LRRDLFFAHQ GHVRRRHLDD
QIDQALCLTL VTNACVLWTT TYLADALDAL RMEGHDVDDE IAAHLTPRST TTSTSTARIP
SISTPNYAAK DTGHST