Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2536 |
Symbol | |
ID | 8448147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 2788637 |
End bp | 2789737 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645041644 |
Product | putative transposase |
Protein accession | YP_003201888 |
Protein GI | 258652732 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00000000586897 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000186966 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCGCATC CCGGTCGCCC GAAGGCTGCC CTCGAGCTCG CTGATGATGA GCGAGAAGCG TTGACGAGGT GGGCCCGTCG ACCCACCTCG TCGCAGCAGC TGGCGTTGCG TTCTCGCATC GTGTTGGTCT GCGCGGACGG GCACACGAAC ACCGCGGTCG CCGAGCAGCT GGGCATCAAC AAGGTGACCG TGGGCAAGTG GCGGGCCCGG TTCGTGCGGC ACCGGCTGGA GGGGCTGACG GACGAGCCGC GTCCGGGCGC GCCGCGCACG GTGATCGACG ACGCGGTGGA GCGGGTGATC GTTAAAACCC TGGAGTCCAA GCCGGTCGAC GCGACCCACT GGTCGACGCG GTCGATGGCC GCGGCCACCG GGATGAGCCA GACGGCGATC TCGCGGATCT GGCGGGCGTT CGGGCTCAAA CCGCACCGGC AGGAGAGCTT CAAGCTGTCC ACGGACCCTC AATTCGTGGA GAAGGTCCGC GACGTCGTCG GCCTGTACCT GAACCCGCCG GAGCAGGCGC TGGTGTTCTG CGTGGATGAG AAGACCCAAG TGCAGGCCCT GGACCGATCC CAGCCGGTCA TCCCGATGAT GCCCGGCACC CCGGAACGCC TCACCCACGA CTACATCCGG GCCGGCACGC TGGACCTGTT CGCCGCCCTC GAGGTTGCCG GCCCCACCGC CGGGCGGGTG ATCACTCAAC TGCACCCGCA GCATCGCGCG ATCGAGTTCC GTAAGTTCCT GGTCGCGATC GATAAGGCGG TCCCGGCCGG CTACGACGTC CACCTGGTCG TGGACAACCT GTCCACCCAC AAGACCGCGG CGATCCACAA GTGGCTCCTG GCGCACCCCC GATTCCACCT GCACTTCACC CCGACCGGGT CGTCCTGGCT CAACCTGGTC GAACGCTGGT TCGCCGAAAT CACCATGAAA CTGATCCGCC GCGGAGTCCA CTGCAGCGTC AAAGACCTCG CCAAGGACAT CACCAACTGG GCCGAGAACT GGAACTCCGA CCCCAAGCCC TACGTCTGGA CCAAGACCGC AGACGACATC CTCGACACCC TCGCCGCATA TTGTCAGCGA ATCAATGACT CAGCACACTA G
|
Protein sequence | MPHPGRPKAA LELADDEREA LTRWARRPTS SQQLALRSRI VLVCADGHTN TAVAEQLGIN KVTVGKWRAR FVRHRLEGLT DEPRPGAPRT VIDDAVERVI VKTLESKPVD ATHWSTRSMA AATGMSQTAI SRIWRAFGLK PHRQESFKLS TDPQFVEKVR DVVGLYLNPP EQALVFCVDE KTQVQALDRS QPVIPMMPGT PERLTHDYIR AGTLDLFAAL EVAGPTAGRV ITQLHPQHRA IEFRKFLVAI DKAVPAGYDV HLVVDNLSTH KTAAIHKWLL AHPRFHLHFT PTGSSWLNLV ERWFAEITMK LIRRGVHCSV KDLAKDITNW AENWNSDPKP YVWTKTADDI LDTLAAYCQR INDSAH
|
| |