Gene Namu_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2079 
Symbol 
ID8447689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2291364 
End bp2292968 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content69% 
IMG OID645041201 
ProductIstA2 
Protein accessionYP_003201446 
Protein GI258652290 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0246359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00457727 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGGTCGA GGGTGGGGTT GTTCGCTGTC ATCCGGCGCG ACGCCCGGGT CGAGGGATTG 
TCGATCCGCG AGCTCGCTGA TCGGCATCAC GTGCACCGCA GAACCGTTCG GCAGGCGATG
GCCAGTGCGT TACCGCCGCC GCGGAAGACA CCGGTGCGGG TCTCCCGGAA GCTCGAACCG
TTCAAGGTCA CGATCGACGA CTGGCTGCGG GCGGACCTGG ACGCGCCGAG GAAGCAACGC
CACACCGCGA AGCGGGTGCT GGACCGGCTC CTCGACGAAC ACGGCGCCGC CGATGTGTCG
TACTCGACGG TGCGGGATTA CGTCGCCCGG CGACGCCCGG AGATCGCCGC CGCGGCCGGC
CGGACCTTGT CGCAGGGTTT CGTCCCGCAG ACCCATGAGC CGGGTGGTGA GGCCGAGGTC
GACTTTGCCG ATCTGTGGGT CGTGCTGCGC GGGGTGAAGA CCAAGACGTT CCTGTTCACC
CTGCGCCTGT CGTATTCCGG GAAGGCGGTG CACCGGGCGT TCGCCACCCA GGGCCAGGAG
GCGTTCCTGG AAGGTCATGT GCACGCGTTC ACCGAACTGG GCGGCACCCC GATCGACAAG
ATCCGCTACG ACAACCTCAA AGCCGCGGTG TCCCGGGTGC TGTTCGGTCG TGGCCGGGAG
GAATCCGGCC GGTGGGTGGC GTTCCGATCC CATTTCGGGT TCGATGCGTT CTACTGCCAC
CCCGGGCAGG AAGGTGCCCA CGAGAAGGGC GGCGTCGAAG GCGAGGGCGG CCGGTTCCGC
CGCAACCACT GCGTCCCGAT GCCGGTCGTG GACTCCATCG AGCAGCTCAA TGAGCTGCTC
GTCGCGGCGG ACGCGAAGGA TAACTACCGG CGGATCGCGA GCCGCACCAA CACCGTCGCC
CAGGACTGGG CGTTCGAACG GGACACGCTG CGGCCGTTGC CGTCCGAGGT GTTCCCGACC
TGGCTGACTC TGACCCCCAG GGTTGACCGG TATGCCCGGG TGACCGTCCG GCAACGGCAC
TACTCGGTGC CGGCCCGGTT CATCGGCCGC CGGGTCCGGG TGCAGCTCGG CGCTTCATCG
GTGACCGCGT TCGACGGTCG CACCGTCATC GCCACCCATG AACGGGTCAT GCTCAAGGGC
GGCCAGTCCC TGGTCCTGGA CCACTACCTC GAGGTGCTGC AACGCAAACC CGGCGCACTG
CCCAACGCGA CCGCGTTGGT GCAGGCCCGC GCGTCCGGAA TGTTCACCGC GGCGCATGAG
GCGTTCTGGG CCGCCGCCCG CAAGGCTCAT GGTGACTCCG GCGGCACCCG AGCGTTGATC
GAGGTGCTGC TGCTGCACCG GCACCTGGCC GCCTCCGATG TGATCGCGGG GATCACCGCC
GCTCTCACGG TGGGCTCGGT CAGCCCGGAC GTCGTCGCTG TCCAGGCCCG CAAAACCGCG
CACCAGTGCA GCGCAGACGC AGTGATCGCA TCACCGAACA CCACACCGGC CGGGGATCGG
GTGGTCAGCC TGACCGAGCG GCGCCTGGCG GAGCTGCCCG CCGACTCGCG CCCGTTGCCG
TCGGTGTCGC AGTACGACGA GCTGCTGACC AGGGAATCGT CATGA
 
Protein sequence
MGSRVGLFAV IRRDARVEGL SIRELADRHH VHRRTVRQAM ASALPPPRKT PVRVSRKLEP 
FKVTIDDWLR ADLDAPRKQR HTAKRVLDRL LDEHGAADVS YSTVRDYVAR RRPEIAAAAG
RTLSQGFVPQ THEPGGEAEV DFADLWVVLR GVKTKTFLFT LRLSYSGKAV HRAFATQGQE
AFLEGHVHAF TELGGTPIDK IRYDNLKAAV SRVLFGRGRE ESGRWVAFRS HFGFDAFYCH
PGQEGAHEKG GVEGEGGRFR RNHCVPMPVV DSIEQLNELL VAADAKDNYR RIASRTNTVA
QDWAFERDTL RPLPSEVFPT WLTLTPRVDR YARVTVRQRH YSVPARFIGR RVRVQLGASS
VTAFDGRTVI ATHERVMLKG GQSLVLDHYL EVLQRKPGAL PNATALVQAR ASGMFTAAHE
AFWAAARKAH GDSGGTRALI EVLLLHRHLA ASDVIAGITA ALTVGSVSPD VVAVQARKTA
HQCSADAVIA SPNTTPAGDR VVSLTERRLA ELPADSRPLP SVSQYDELLT RESS