Gene Namu_4069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4069 
Symbol 
ID8449689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4484016 
End bp4485344 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content69% 
IMG OID645043113 
Producttransposase IS204/IS1001/IS1096/IS1165 family protein 
Protein accessionYP_003203348 
Protein GI258654192 
COG category[L] Replication, recombination and repair 
COG ID[COG3464] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.414136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAGA GTAGTAGCTC GCTGCTGTTG GACATTGACG GGTTGGTCGT CGACCGGGTC 
GTCCGCAACG ACGCCGGCCG ACGGGTCGTG CACTGCTCGA CCGACCCCCA ACTGGCCGGC
TGGTGCCCGG AGTGCGGTGA GCAGTCGAAG TCTCCGAAGG CGTGGGTGAC GACCCGCCCG
CGGGACGTCC GGCTCGGCGA GGACAAGCCG ATCCTGTTGT GGCGCAAACG GAAATGGCGC
TGCCAGGTCG ACAGCTGCGA GCGGAAGGTG TTCACCGAAT GCCTACCCGA GCAGATCCCT
GCCCGGGCCC GGATCACCAC CCGCGCCCGC CGGCTGGCGG CGGAAGCGAT CGGCGACCAC
ACCCGACCGG TGTCCGGCGT CGCGGCCGAG TTCGGCATGG ACTGGCGCAT CGCGCACGAC
GCGTTCGTCG CCCACTCTGC CGCGGTGCTC CCCGACGCGC CGCCGCCGGT CACCGTGCTG
GGCGTCGACG AGACCCGCCG CGGCAAGGCC CACTACGAGA CCGACCCGAC CACCGGGGAG
AAGACCTGGG TGGACCGGTT CGACACCGGC CTGGTCGATC TGAGCGGCAA CGGTGGCCTG
TTCGCACAGG TCAACGGCCG CACCAGCAAG GTCCTCATCG AGTGGCTGCA GGCGCAGGAC
CCGGACTGGC TCGCCACCAT CACCCACATC TCGATGGACA CGTCCGCGAC GTACGCCCGC
GCCGCCCGCC TCGCCCTGCC GAACGCCGTC GTGGTCGTGG ACCGGTTCCA CCTGGTCGCC
CTGGCCAACA AGGCGGTCAC CGACTACCGG CGGGAGTTGG CCTGGGCGCT TCGTGGCCGG
CGGGGCCGCA AGTGCGACCC GGAATGGGCG CAACGGAACC GGCTGCTGCG CGCCGTGGAG
ACTCTCACTC CGGACGAGCT GGCCAAGGTG CGGGAAGCGA TGCGCCGGGC CGACCCCTCC
GGCGGCCTCG AGAAATGCTG GCAGGGCAAG GAACTGCTCC GCAAGCTGCT CAAGCTCGCC
GGCACCAACC CCGACCGCGG ACAGATCTTC AACGCGCTGA CCGCGTTCTA CCTGCACTGC
GCCGACTCCG AGATCTCCCA GCTGCGCAGG CTCGCGTGGA CGGTGCATGC CTGGCAGAAC
TCGATCATCG CCGGCCTGCA CACCGGCATC AGCAACGGCC GCACCGAGGG CTACAACCGG
ATCGTCAAAC ACATCGGCCG GATCGCGTTC GGCTTCCGCA ACCAGGACAA CCAGAAGCGG
CGGATACGCT ACGCCTGCAC CCGGAAATCC CGGGCGTCAA CCAGCAGCGC GAAGCCCTGC
CAACTCTGA
 
Protein sequence
MDQSSSSLLL DIDGLVVDRV VRNDAGRRVV HCSTDPQLAG WCPECGEQSK SPKAWVTTRP 
RDVRLGEDKP ILLWRKRKWR CQVDSCERKV FTECLPEQIP ARARITTRAR RLAAEAIGDH
TRPVSGVAAE FGMDWRIAHD AFVAHSAAVL PDAPPPVTVL GVDETRRGKA HYETDPTTGE
KTWVDRFDTG LVDLSGNGGL FAQVNGRTSK VLIEWLQAQD PDWLATITHI SMDTSATYAR
AARLALPNAV VVVDRFHLVA LANKAVTDYR RELAWALRGR RGRKCDPEWA QRNRLLRAVE
TLTPDELAKV REAMRRADPS GGLEKCWQGK ELLRKLLKLA GTNPDRGQIF NALTAFYLHC
ADSEISQLRR LAWTVHAWQN SIIAGLHTGI SNGRTEGYNR IVKHIGRIAF GFRNQDNQKR
RIRYACTRKS RASTSSAKPC QL