Gene Namu_2061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2061 
Symbol 
ID8447671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2274988 
End bp2276385 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content71% 
IMG OID645041185 
Producttransposase IS4 family protein 
Protein accessionYP_003201430 
Protein GI258652274 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000689755 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00689759 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGAAGT CTACCGGGCT GTACCCGTCG CTCCGAGTCG ATGCCACCGG CAAGCGGGTG 
GTGTCCCACG GCGGGTCGGT GCTGCTGGCC TTGGCCGCGG ACAGGGTCGG TTTGGGTCGT
GGGTTGTCGG CGGCGTTGAG GCCGTGGCGC AAGCCGATGG CGGTGCACGA CCCGGGCAAG
ATCCTGCTCG ACCTGGCGAT CTCGCTGGCG ATCGGCGGTG ACTGCCTGGC CGACATCGCC
CAACTGCGGG CCGAGCCCGC CGTGTTCGGT CATGTGGCGT CCGACCCGAC TGTGTCCCGG
CTGATCGACA CCCTCGCCCC GGACGCGACG GCAGCGCTGA AGGCGATCGA CACCGCGCGG
GCGGCAGCCC GCGCCCGGGC GTGGAAGCTG GCCGGACCCG CCGCACCGAA CCATGACCGG
TCGGCGAAGG CGCCGCTGGT CGTCGACGTG GACGCCACCC TGGTCACCGC GCACTCCGAA
AAGCAGCTGG CAGCAGCAAC ATTCAAGAAG GGCTTCGGGT TCCACCCGAT CGGCGCGTGG
GCCGACCACG GCCCGGACGG CACCGGTGAA CCCCTGGCGA TGCTGCTGAG GCCGGGCAAC
GCCGGCTCCA ACACCGCAGC CGACCACATC AGCGTGGTCA AGGCCGCGCT CGCGCAACTG
CCCTGCACCA CGGCGGACCG ACGGCCCGGC CGCGGTGTGC TGGTCCGCAC CGACGGGGCC
GGCGGAACCC ACGAGTTCGT GGACTGGATG GCCCGGCAAC GGGTCCAGTA CTCGGTCGGG
TTCACCCTGA CCACCGACAT CACCGCCAAG GTCGACGCCC TGCCGGAGGC GGCGTGGACA
CCCGCGTACA ACGCCGACCA GGAGCCCCGG GACGGGGCCT GGGTGGCCGA ACTGACCGGG
GTCCTCAAGC TCAAGGGCTG GCCCAAGGAC ATGCGGGTCA TCGTCCGCGC CGAACGACCC
CATCCCGGCG CTCAGCTCAA GTTCACCGAC TCGAACGGCA ACCGGCTCAC CGCGTTCGCC
ACGAACACCA AAGGCGGACA GCTCGCGGAT CTGGAACTGC GGCATCGGCG CCGCGCCCGC
TGCGAGGACC GGATCCGCAA CGCCAAGGAC ACCGGCCTGA ACAACCTGCC CCTCAACGAC
TTTGCCCAGA ATCAAGTGTG GATCGCGGTC GTGCAACTGG CCACCGAACT GACCGCATGG
ATGCAGATGC TCGCCTTCAC CGGCACCCCG GCGCGGACCT GGGAGCCCAA GAAGCTGCGG
CACCGACTGT TCAGCGTCGC CGCCCGGATC GGCCGCAAAG CCCGCCGTAC CTGGCTCCGC
CTGTCCGCGC ACGCACCCCA CCGCGACCTC CTCCTGCACG GCCTGGCCCG GCTGCGGAAC
CTGCCGCAAC TGACCTGA
 
Protein sequence
MRKSTGLYPS LRVDATGKRV VSHGGSVLLA LAADRVGLGR GLSAALRPWR KPMAVHDPGK 
ILLDLAISLA IGGDCLADIA QLRAEPAVFG HVASDPTVSR LIDTLAPDAT AALKAIDTAR
AAARARAWKL AGPAAPNHDR SAKAPLVVDV DATLVTAHSE KQLAAATFKK GFGFHPIGAW
ADHGPDGTGE PLAMLLRPGN AGSNTAADHI SVVKAALAQL PCTTADRRPG RGVLVRTDGA
GGTHEFVDWM ARQRVQYSVG FTLTTDITAK VDALPEAAWT PAYNADQEPR DGAWVAELTG
VLKLKGWPKD MRVIVRAERP HPGAQLKFTD SNGNRLTAFA TNTKGGQLAD LELRHRRRAR
CEDRIRNAKD TGLNNLPLND FAQNQVWIAV VQLATELTAW MQMLAFTGTP ARTWEPKKLR
HRLFSVAARI GRKARRTWLR LSAHAPHRDL LLHGLARLRN LPQLT