Gene Namu_2839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2839 
Symbol 
ID8448452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3105881 
End bp3107119 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content70% 
IMG OID645041931 
Producttransposase IS111A/IS1328/IS1533 
Protein accessionYP_003202173 
Protein GI258653017 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00132212 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0366179 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCCTGC CTGAATGCGA TTTCTACGTC GGTATCGACT GGGCCGCCGA AACGCACGCC 
GTCTGCGTGC AGAACGCCAC CGGAAGGATC ACCGCCGAGT TCACGATCGA GCACACCGCC
GACGGTTTCG CGACCCTGCT CGCGCGTCTG GGCCGGTTGG CCGACGACCC GATGCAGGTC
ACGGTCGCGA TCGAACGGCC CGACGGCCGG CTTGTCGACG CGCTGCTCGA GGCCGGCTAC
CCGGTGGTGC CGGTCAGCCC GAACGCGATC AAGACCTGGC GGGACGGTGA GGTGCTCTCC
GGCGCCAAGT CCGACGCCGG CGACGCCGCC GTCATCGCCG AGTACCTGCG GTTGCGGTCG
CACCGGCTGC GGCCGGCCAC CCCGTTCACC GCGCAGACCA AGGCGGTTCG GACGGTCGTG
CGCACCCGAG ACGACATCGT GGCCATGCGG ACCGCCGCGA CGAACCAGCT GACCGCCCTG
CTCGATGCCC ACTGGCCCGG CGCCACCAAG GTTTTCGCCG ATATCGAGTC GCCGATCGCG
TTGGAGTTCC TGACCCGGTA CCCGACCGCC AAACACGCCG CCGGCCTCGG CGAGAAGCGC
ATGGCCGCGT TCTGCGTCAA GCACGGCTAC TCCGGTCGCC GCCCGGCCGC GGAGCTGCTG
ACCCGGCTGC GGGCTGCGCC GGCCGGCACC ACCGACCCCG ACCTGGTCGA GGCCGTCCGG
GACGCCGTGC TCGCGCTGGT GGCCGTGCTG CGCACCCTCG GCGCGGTCCG CAAGGACCTG
GACCGGTCGG TGACCGCCCA CCTCGGGGAG CACCCGGACG CCGCGATCTT CACGTCGCTG
CCAAGGTCGG GTCAGATCAA CGCCGCCCAG GTGCTCGCCG AGTGGGGCGA CTCCCGCCAA
GCCTACGACT CGCCCGACGC CGTCGCGGCG TTGGCCGGCC TGACCCCGGT CACCAAAGCG
TCCGGTAAAT ATCATGCCGT GCATTTCCGC TGGGCCTGCA ACAAAAGGTT CCGCAAAGCG
ATGACCACGT TCGCCGACAA CAGTCGCCAC CAAAGCCCGT GGGCCGCCGA GGTCTACCGC
AGAGCTATCC AACGCGGGCA CGACCACCCG CACGCCGTCC GGGTCCTGGC CCGCGCCTGG
GTGCGCGTGA TCTACCGCTG CTGGCTCGAC CGGCAGCCTT ACGACCCGGC CAGGCACGGC
AACGCGAACA AGATCAACAG CGGGCAACTT GCGGCCTGA
 
Protein sequence
MSLPECDFYV GIDWAAETHA VCVQNATGRI TAEFTIEHTA DGFATLLARL GRLADDPMQV 
TVAIERPDGR LVDALLEAGY PVVPVSPNAI KTWRDGEVLS GAKSDAGDAA VIAEYLRLRS
HRLRPATPFT AQTKAVRTVV RTRDDIVAMR TAATNQLTAL LDAHWPGATK VFADIESPIA
LEFLTRYPTA KHAAGLGEKR MAAFCVKHGY SGRRPAAELL TRLRAAPAGT TDPDLVEAVR
DAVLALVAVL RTLGAVRKDL DRSVTAHLGE HPDAAIFTSL PRSGQINAAQ VLAEWGDSRQ
AYDSPDAVAA LAGLTPVTKA SGKYHAVHFR WACNKRFRKA MTTFADNSRH QSPWAAEVYR
RAIQRGHDHP HAVRVLARAW VRVIYRCWLD RQPYDPARHG NANKINSGQL AA