Gene Namu_2085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2085 
Symbol 
ID8447695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2299975 
End bp2301069 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content70% 
IMG OID645041207 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_003201452 
Protein GI258652296 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.000256 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0160957 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTACTG TGGCAGATTC GGTCGACGCC GTCATCGGCG GTGACACCCA TGTGGACACG 
ACGAGTTTGT CTGTGGTGTC CCCGGTCGGG GCGGTGATCG AGCAGATCAC GATCGACAAC
GACGAGCAGG GTTACGCCCA GGTGGTGACC TGGATCCTGC GGGTGGTGCC TTCCGGCCGG
TTCCTGGTCG GGTTGGAGGG CACCCGCAGC TACGGTGCGG GGCTGTGCCG GGCGTTGGAA
GCGGTCGGGA TCCGGGTCGT CGAGGTCGAG CGGCCCTCCC GCGGGGAGCG GGGCCGGCGC
GGCAAGTCCG ACCCGGGCGA TGCGGTGCTA GCCGCTCGTA AGGTGCTGGC CATGGCGGTG
GAGCGGGTAC CGGCGCCGCG GACCGGCGAG GGAGTGCGGG AGGCGTTGCG GCTGCTGGTG
GTGGACCGGG AACAGATGAC CCGGCACCGG ACCCAGCTGC ACAACCAGCT GCTGGCCGAG
CTGCTCACCG GCACCGCCGA GCACCAGGCG CTGCGCCGAA AAGGTTTGAG TGGAACAGAT
CTGGAGAAGC TGGCCAAGTC TCGTTGCCGG GGTGGGCGGC CGATCGAGGA GCAGGCCCGG
CTGGTCGTGC TGCGCCGCAA GGCGAACGCG ATCATCCAGT TGGATCAGCA GATCCGGGAC
AACGGCAAGA GCCTGACGAC GATCGTTCAG GACGCTGCTC CGCAGCTGCT CAAGCAGGTC
GGCGTCGGCC CGGTCGTCGC CGCCCAGCTG ATCGTGTCCT ACAGCCACCA CGGCCGCTGC
CGGGACGAGG CGGCGTTCGC GGCGCTGGCC GGGGCCAGCC CGGTCCCGGC GTCCAGCGGC
CGGATCGTGC GGCACCGGCT CAACCGGGGC GGCGACCGCC AACTGAACCG GGCCCTGCAC
ACCGTCGCGG TCACCCGGGC CCAGTGGGAC GAGCGAACCC AGGACTACAT CCGCCGGCGC
AGCGGCAGCC TGACGGCCAA AGAGATCCGC CGGATGCTCA AGCGGTACAT CGCCCGCGAG
ATGTTCAAGA TCCTGCGCAC CATCGAGGCG TTGAACCCGA CGATGACCAA CGGTCAGGCC
GCCACGGCCG CCTGA
 
Protein sequence
MPTVADSVDA VIGGDTHVDT TSLSVVSPVG AVIEQITIDN DEQGYAQVVT WILRVVPSGR 
FLVGLEGTRS YGAGLCRALE AVGIRVVEVE RPSRGERGRR GKSDPGDAVL AARKVLAMAV
ERVPAPRTGE GVREALRLLV VDREQMTRHR TQLHNQLLAE LLTGTAEHQA LRRKGLSGTD
LEKLAKSRCR GGRPIEEQAR LVVLRRKANA IIQLDQQIRD NGKSLTTIVQ DAAPQLLKQV
GVGPVVAAQL IVSYSHHGRC RDEAAFAALA GASPVPASSG RIVRHRLNRG GDRQLNRALH
TVAVTRAQWD ERTQDYIRRR SGSLTAKEIR RMLKRYIARE MFKILRTIEA LNPTMTNGQA
ATAA