Gene Namu_0121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0121 
Symbol 
ID8445701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp134624 
End bp136021 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content70% 
IMG OID645039269 
Producttransposase IS4 family protein 
Protein accessionYP_003199544 
Protein GI258650388 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones88 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAAGT CTACCGGGCT GTACCCGTCG CTCCGAGTCG ATGCCACCGG CAAGCGGGTG 
GTGTCCCACG GCGGGTCGGT GCTGCTGGCC TTGGCCGCGG ACAGGGTCGG TTTGGGTCGT
GGGTTGTCGG CGGCGTTGAG GCCGTGGCGC AAGCCGATGG CGGTGCACGA CCCGGGCAAG
ATCCTGCTCG ACCTGGCGAT CTCGCTGGCG ATCGGCGGTG ACTGCCTGGC CGACATCGCC
CAACTGCGGG CCGAGCCCGC CGTGTTCGGT CATGTGGCGT CCGACCCGAC TGTGTCCCGG
CTGATCGACA CCCTCGCCTC GGACGCGACG GCAGCGCTGA AGGCGATCGA CACCGCGCGG
GCGGCAGCCC GCGCCCGGGC GTGGAAGCTG GCCGGACCCG CCGCACCGAA CCATGACCGG
TCGGCGAAGG CGCCGCTGGT CGTCGACGTG GACGCCACCC TGGTCACCGC GCACTCCGAA
AAGCAGCTGG CAGCAGCAAC ATTCAAGAAG GGCTTCGGGT TCCACCCGAT CGGCGCGTGG
GCCGACCACG GCCCGGACGG CACCGGTGAA CCCCTGGCGA TGCTGCTGAG GCCGGGCAAC
GCCGGCTCCA ACACCGCAGC CGACCACATC AGCGTGGTCA AGGCCGCGCT CGCGCAACTG
CCCTGCACCA CGGCGGACCG ACGGCCCGGC CGCGGTGTGC TGGTCCGCAC CGACGGGGCC
GGCGGAACCC ACGAGTTCGT GGACTGGATG GCCCGGCAAC GGGTCCAGTA CTCGGTCGGG
TTCACCCTGA CCACCGACAT CACCGCCAAG GTCGACGCCC TGCCGGAGGC GGCGTGGACA
CCCGCGTACA ACGCCGACCA GGAGCCCCGG GACGGGGCCT GGGTGGCCGA ACTGACCGGG
GTCCTCAAGC TCAAGGGCTG GCCCAAGGAC ATGCGGGTCA TCGTCCGCGC CGAACGACCC
CATCCCGGTG CCCAGCTGAA GTTCACCGAC TCGAACGGCA ACCGGCTCAC CGCGTTCGCC
ACGAACACCA AAGGCGGACA GCTCGCGGAT CTGGAACTGC GGCATCGGCG CCGCGCCCGC
TGCGAGGACC GGATCCGCAA CTCGAAGGAC ACCGGCCTGA ACAACCTGCC CCTCAACGAC
TTTGCCCAGA ATCAAGTGTG GATCGCGGTC GTGCAACTGG CCACCGAACT GACCGCATGG
ATGCAGATGC TCGCCTTCAC CGGCACCCCG GCGCGGACCT GGGAGCCCAA GAAGCTGCGG
CACCGACTGT TCAGCGTCGC CGCCCGGATC GGCCGCAAAG CCCGCCGTAC CTGGCTCCGC
CTGTCCGCGC ACGCACCCCA CCGCGACCTC CTCCTGCACG GCCTGGCCCG GCTGCGGAAC
CTGCCGCAAC TGACCTGA
 
Protein sequence
MRKSTGLYPS LRVDATGKRV VSHGGSVLLA LAADRVGLGR GLSAALRPWR KPMAVHDPGK 
ILLDLAISLA IGGDCLADIA QLRAEPAVFG HVASDPTVSR LIDTLASDAT AALKAIDTAR
AAARARAWKL AGPAAPNHDR SAKAPLVVDV DATLVTAHSE KQLAAATFKK GFGFHPIGAW
ADHGPDGTGE PLAMLLRPGN AGSNTAADHI SVVKAALAQL PCTTADRRPG RGVLVRTDGA
GGTHEFVDWM ARQRVQYSVG FTLTTDITAK VDALPEAAWT PAYNADQEPR DGAWVAELTG
VLKLKGWPKD MRVIVRAERP HPGAQLKFTD SNGNRLTAFA TNTKGGQLAD LELRHRRRAR
CEDRIRNSKD TGLNNLPLND FAQNQVWIAV VQLATELTAW MQMLAFTGTP ARTWEPKKLR
HRLFSVAARI GRKARRTWLR LSAHAPHRDL LLHGLARLRN LPQLT