Gene Namu_5286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5286 
Symbol 
ID8450919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5903354 
End bp5904592 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content70% 
IMG OID645044319 
Producttransposase IS111A/IS1328/IS1533 
Protein accessionYP_003204541 
Protein GI258655385 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCCTGC CCGAATGCGA TTTCTACGTC GGTATCGATT GGGCCGCCCA AACGCACGCC 
GTGTGTGTGC AGGACGCGGC CGGGAAGATC ACCGCGCAGT TCACGATCGA GCACACCGCC
GACGGATTCG CGACCCTGTT GGCCCGTCTG GGTCGGTTGG CCAGCGACCC GATGCAGGTC
AGCGTCGCCC TGGAACGGCC CGACGGTCGG CTGGTCGACG CCCTGCTCGA AGCGGGCTAC
CCGGTGGTGC CGGTCAGCCC GAACGCGATC AAGACCTGGC GCGACGGGGA GGTACTGTCC
GGCGCCAAGT CCGACGCCGG CGACGCCGCC GTCATCGCCG AATATCTGCG GCTGCGGTCG
CACCGGCTGC GGCCGGCCAC CCCGTTCACC CCGGCGACCA GGGCGCTGCG TACGGTCGTA
CGCACCCGCG ATGACATCGT GGCCATGCGG ACCGCCACGG CGAACCAGCT GACCGCCCTG
CTCGATGCCC ACTGGCCCGG CGCCACCAAG GTTTTCGCCG ATATCGAGTC GCCGATCGCG
TTGGAGTTCC TGACCCGGTA CCCGACCGCC AAACACGCCG CGGGCCTGGG TGAGAAGCGC
ATGGCCGCGT TCTGCGTCAA GCACGGCTAC TCCGGTCGCC GCTCGGCCGC GGAGCTGCTG
ACCCGATTGC GGGCTGCGCC GGCCGGCACC ACCGACCCGG ACCTGGTCGA GGCCGTCCGG
GACGCCGTGC TGGCGCTGGT GGCCGTGCTG CGCACCCTGG GCGAGACCCG CAAGGACCTG
GACCGGTCGG TGACCGCCCA CCTCGGGGAG CACCCGGACG CCGCGATCTT CACGTCGCTG
CCAAGGTCGG GTCAGATCAA CGCCGCCCAG GTGCTCGCCG AGTGGGGCGA TTCCCGGCAA
GCCTACGACT CGCCCGACGC CGTCGCGGCG TTGGCCGGCC TGACCCCGGT CACCAAAGCG
TCCGGTAAAT ATCATGCCGT GCATTTCCGG TGGGCCTGCA ACAAACGATT CCGTAAAGCG
ATGACCACGT TCGCCGACAA CAGTCGCCAC CAAAGCCCGT GGGCCGCCGA GGTCTACCGC
AGAGCTATCC AACGCGGGCA CGACCACCCG CACGCCGTCC GGGTCCTGGC CCGCGCCTGG
GTGCGCGTGA TCTACCGCTG CTGGCTCGAC CGAGAGCCTT ACGACCCGGC CAGGCACGGC
AACGCGAACA AGATCAACAG CGGGCAACTT GCGGCCTGA
 
Protein sequence
MSLPECDFYV GIDWAAQTHA VCVQDAAGKI TAQFTIEHTA DGFATLLARL GRLASDPMQV 
SVALERPDGR LVDALLEAGY PVVPVSPNAI KTWRDGEVLS GAKSDAGDAA VIAEYLRLRS
HRLRPATPFT PATRALRTVV RTRDDIVAMR TATANQLTAL LDAHWPGATK VFADIESPIA
LEFLTRYPTA KHAAGLGEKR MAAFCVKHGY SGRRSAAELL TRLRAAPAGT TDPDLVEAVR
DAVLALVAVL RTLGETRKDL DRSVTAHLGE HPDAAIFTSL PRSGQINAAQ VLAEWGDSRQ
AYDSPDAVAA LAGLTPVTKA SGKYHAVHFR WACNKRFRKA MTTFADNSRH QSPWAAEVYR
RAIQRGHDHP HAVRVLARAW VRVIYRCWLD REPYDPARHG NANKINSGQL AA