Gene Namu_3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3040 
Symbol 
ID8448653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3338601 
End bp3339866 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content69% 
IMG OID645042124 
ProductIntegrase catalytic region 
Protein accessionYP_003202366 
Protein GI258653210 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000772165 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00538259 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCTGATTG TGGATGACTG GGCGGAGATC CGTCGGTTGC ATCGGGCGGA GGGGATGCCG 
ATCCGGGCGA TCGCTCGGCG TCTGGGGTGT TCGAAGAACA CTGTGAAGCG GGCGTTGGCC
GCGCAGGGTC CGCCGAGGTA TGAGCGGGCG ACGGTCGGGT CGGCCGTTGA TGCGTTCGAG
CCGGCCATCC GGGCGTTGTT GGCGGAGTTT CCGTCGATGC CGACGTCGGT GATCATGGAG
CGGGTTGGGT GGTCGCGGGG CCGCACGGTG TTCTTCGAGC GGGTCGCGGT GTTGCGGCCG
TTGTTCGTGC CGCCGGATCC GGCGTCGCGG ACGGAGTATG GGCCGGGGCA GTTGGCGCAG
TGCGATCTGT GGTTTCCGCC GGTGGACGTG CCGGTGGGGT TCGATCAGGT CGCCCGTCCA
CCGGTGCTAG TGATGGTGTC GGGGTTCTCA CGGGTCATCA CGGCCAGGAT GCTGCCGTCG
CGGCAGTCTG CGGATCTACT GGCTGGGCAT TGGGAGCTGC TGTTGGGGTG GGGTCGCTTG
CCCAGAGCCC TGGTCTGGGA CAACGAGGCC GCGGTCGGCC GGTGGCGCGG CGGCCGACCG
GAACTGACCG AACCGATGAA CGCCTTCCGT GGAACGTTGG GTATCAAGGT CGTGCTCTGC
GCGCCGCGCG ACCCTGAGTC CAAGGGCCTG GTCGAGCGGG CGAACGGCTA TCTGGAGACC
TCATTCCTGC CCGGCCGCAC GTTCACCTCC CCGGCTGACT TCAACGCCCA GCTGGCTGCG
TGGCTGGTCC GGGCGAACCA GCGGCAACAC CGCCGGCTCG GGTGCCGGCC CATCGACCGG
TGGGCGGCGG ACCTGGCCGC GATGATGGCG ATGCCACCGG TTGCGCCGGT GGTGGGCTGG
ACCGCGTCGC CGCTGCTGCC TCGTGATCAT TACGTCCGCG TCGATTCCAA CGACTATTCG
GTGCATCCCG GTGTGGTCGG TCGACGGGTG CAGGTGCTGG CCGATCTGGA TCAGGTCGTG
GTGACCTGCG CCGGCACGGT CGTGGCCGCG CACGAACGGT GCTGGGCGCG GCGGCAGACC
ATCACCGATG CCGACCATGC CCAGGCCGCG GCGGCGTTAC GCGCCGCCCA CCGCGAACGG
GTGCGACGGC CGGTAGAGAC CGACGTCGCG GTCCGCGAAC TCGCCGATTA CGACCGCATC
TTCGGCCTGC AGGACGACCT CGACGATCAT CCCAGCGTCG ACGTGGCCGA CGGTGAGGTC
GCCTGA
 
Protein sequence
MLIVDDWAEI RRLHRAEGMP IRAIARRLGC SKNTVKRALA AQGPPRYERA TVGSAVDAFE 
PAIRALLAEF PSMPTSVIME RVGWSRGRTV FFERVAVLRP LFVPPDPASR TEYGPGQLAQ
CDLWFPPVDV PVGFDQVARP PVLVMVSGFS RVITARMLPS RQSADLLAGH WELLLGWGRL
PRALVWDNEA AVGRWRGGRP ELTEPMNAFR GTLGIKVVLC APRDPESKGL VERANGYLET
SFLPGRTFTS PADFNAQLAA WLVRANQRQH RRLGCRPIDR WAADLAAMMA MPPVAPVVGW
TASPLLPRDH YVRVDSNDYS VHPGVVGRRV QVLADLDQVV VTCAGTVVAA HERCWARRQT
ITDADHAQAA AALRAAHRER VRRPVETDVA VRELADYDRI FGLQDDLDDH PSVDVADGEV
A