Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4487 |
Symbol | |
ID | 8450114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4990976 |
End bp | 4992187 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645043531 |
Product | transposase IS4 family protein |
Protein accession | YP_003203759 |
Protein GI | 258654603 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5659] FOG: Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTACGATG GCGCGGTGAC CCGAGCCGAC CTGGATACAT GGAATGCCGG CCTGGAGGAC CTGCTCGCCC GGATGACCCC GATATTCTAT CGAACCGAAT CACGAAGACA CGCCGAACAA TACCTGCGTG GTCTGCTGTC GCCGCTGCAA CGCAAGAACG GGTGGACGAT CGCCGAACAC GTCGGCGAAT CCGAGCCCAA GGCCTTACAA CGGTTCCTGA ACATCTCACC GTGGGACGTC GAGCAGTTGC TGGTCCTGAA CCGTGATTAT GTCATGGGTC ATCTGGCGTC TCCGGAGGCG ATCCTGGTCG CCGATCCGAC CGGGTTCGCG AAGAAGGGCA AAAAATCGGT CGGTGTCCAA CGACAGTATT CGGGGACGTT GGGGCGGATC GACAACTGCC AGATCGCGAC CTTCCTGGCT TACGTCACTC CCGGCCGGGA CCGGGTGCTG ATCGATCGGC GGCTCTACCT GCCGGAGAAG TCCTGGCTGG CCGACCCGGC GCGATGCGCA GAAGCGGGAG TGACTGCCGA CACCGTGTTC CGGACCAGAC CGGAGCAGGT CATCGAGATG ATCAAGGTGG CCCGCGCGGC CGACGTGCCG TTCGCCTGGT TCACCGCGGA CGAGGAATTC GGACAGAACC CGGGGCTGCG GGAATACCTC GAAAACACCG GCATCAGCTA CGTGATGGCC ATCCCGAAGA ACACCACGTT CACCGACCAC ACCGGCCGTT CACGTCCAGT CTCAGAAATT CCCCTTTCGT TGAAGCCGAC CGCCTGGCAA CGTCGCGCCT GCGGCATCGG CGCGAAAGGA TACCGCGTCT ACGACTGGGT CCTGATCGAA ACCGACGACC CTTCCAACCA ATTCATGATC CGCCGCTCGA CCGACAACGG TGAACTTGCC TTCTACCACT GCCACAACCC GAACCGCACC GGATTCGGTC AACTCGTCAC CGTGGCCGGC GCTCGATGGC CGATCGAGGA ATGTTTCGGC GCCAGCAAGA ACGAAACCGG TCTTGATCAA TACCAGGTCC GAAAACACAA CGCATGGCAT CGCCACATCA CCCTGGCCAT GCTCGCCCAC TCCTTCCTCA CGATCACCGC ACATCAGGCC AAAAAGGGGG ATCCGGACCA CCACCCGACG GTTCCCCCGG ACTCGCCGAA AAGCTCCGGC GCCTCATCGC GCTCACCGTC GCCGAAATAC GTCGACTCCT GA
|
Protein sequence | MYDGAVTRAD LDTWNAGLED LLARMTPIFY RTESRRHAEQ YLRGLLSPLQ RKNGWTIAEH VGESEPKALQ RFLNISPWDV EQLLVLNRDY VMGHLASPEA ILVADPTGFA KKGKKSVGVQ RQYSGTLGRI DNCQIATFLA YVTPGRDRVL IDRRLYLPEK SWLADPARCA EAGVTADTVF RTRPEQVIEM IKVARAADVP FAWFTADEEF GQNPGLREYL ENTGISYVMA IPKNTTFTDH TGRSRPVSEI PLSLKPTAWQ RRACGIGAKG YRVYDWVLIE TDDPSNQFMI RRSTDNGELA FYHCHNPNRT GFGQLVTVAG ARWPIEECFG ASKNETGLDQ YQVRKHNAWH RHITLAMLAH SFLTITAHQA KKGDPDHHPT VPPDSPKSSG ASSRSPSPKY VDS
|
| |