Gene Namu_4487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4487 
Symbol 
ID8450114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4990976 
End bp4992187 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content62% 
IMG OID645043531 
Producttransposase IS4 family protein 
Protein accessionYP_003203759 
Protein GI258654603 
COG category[L] Replication, recombination and repair 
COG ID[COG5659] FOG: Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACGATG GCGCGGTGAC CCGAGCCGAC CTGGATACAT GGAATGCCGG CCTGGAGGAC 
CTGCTCGCCC GGATGACCCC GATATTCTAT CGAACCGAAT CACGAAGACA CGCCGAACAA
TACCTGCGTG GTCTGCTGTC GCCGCTGCAA CGCAAGAACG GGTGGACGAT CGCCGAACAC
GTCGGCGAAT CCGAGCCCAA GGCCTTACAA CGGTTCCTGA ACATCTCACC GTGGGACGTC
GAGCAGTTGC TGGTCCTGAA CCGTGATTAT GTCATGGGTC ATCTGGCGTC TCCGGAGGCG
ATCCTGGTCG CCGATCCGAC CGGGTTCGCG AAGAAGGGCA AAAAATCGGT CGGTGTCCAA
CGACAGTATT CGGGGACGTT GGGGCGGATC GACAACTGCC AGATCGCGAC CTTCCTGGCT
TACGTCACTC CCGGCCGGGA CCGGGTGCTG ATCGATCGGC GGCTCTACCT GCCGGAGAAG
TCCTGGCTGG CCGACCCGGC GCGATGCGCA GAAGCGGGAG TGACTGCCGA CACCGTGTTC
CGGACCAGAC CGGAGCAGGT CATCGAGATG ATCAAGGTGG CCCGCGCGGC CGACGTGCCG
TTCGCCTGGT TCACCGCGGA CGAGGAATTC GGACAGAACC CGGGGCTGCG GGAATACCTC
GAAAACACCG GCATCAGCTA CGTGATGGCC ATCCCGAAGA ACACCACGTT CACCGACCAC
ACCGGCCGTT CACGTCCAGT CTCAGAAATT CCCCTTTCGT TGAAGCCGAC CGCCTGGCAA
CGTCGCGCCT GCGGCATCGG CGCGAAAGGA TACCGCGTCT ACGACTGGGT CCTGATCGAA
ACCGACGACC CTTCCAACCA ATTCATGATC CGCCGCTCGA CCGACAACGG TGAACTTGCC
TTCTACCACT GCCACAACCC GAACCGCACC GGATTCGGTC AACTCGTCAC CGTGGCCGGC
GCTCGATGGC CGATCGAGGA ATGTTTCGGC GCCAGCAAGA ACGAAACCGG TCTTGATCAA
TACCAGGTCC GAAAACACAA CGCATGGCAT CGCCACATCA CCCTGGCCAT GCTCGCCCAC
TCCTTCCTCA CGATCACCGC ACATCAGGCC AAAAAGGGGG ATCCGGACCA CCACCCGACG
GTTCCCCCGG ACTCGCCGAA AAGCTCCGGC GCCTCATCGC GCTCACCGTC GCCGAAATAC
GTCGACTCCT GA
 
Protein sequence
MYDGAVTRAD LDTWNAGLED LLARMTPIFY RTESRRHAEQ YLRGLLSPLQ RKNGWTIAEH 
VGESEPKALQ RFLNISPWDV EQLLVLNRDY VMGHLASPEA ILVADPTGFA KKGKKSVGVQ
RQYSGTLGRI DNCQIATFLA YVTPGRDRVL IDRRLYLPEK SWLADPARCA EAGVTADTVF
RTRPEQVIEM IKVARAADVP FAWFTADEEF GQNPGLREYL ENTGISYVMA IPKNTTFTDH
TGRSRPVSEI PLSLKPTAWQ RRACGIGAKG YRVYDWVLIE TDDPSNQFMI RRSTDNGELA
FYHCHNPNRT GFGQLVTVAG ARWPIEECFG ASKNETGLDQ YQVRKHNAWH RHITLAMLAH
SFLTITAHQA KKGDPDHHPT VPPDSPKSSG ASSRSPSPKY VDS