Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4047 |
Symbol | |
ID | 8449667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4462240 |
End bp | 4463790 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645043093 |
Product | HNH endonuclease |
Protein accession | YP_003203328 |
Protein GI | 258654172 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0910573 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.468754 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGTCG ACGACCACTG GGGCGGCCGC CCGGTGCGGG TCTGGTCCGG GGCCCGGCCC CCGGCCCGCT GGGGCCTGCC GGAGTCGTTG ACCCACGGGG TCGTGTTGCC CTTCCGTGCC AGCGCCGCTC GCGCGGCCGT CCGTGTGCGG TCCTGGCGGG AGACCCTGGC CACCACGGGC CCGGGGGCCG ACCTGGCCCG GGTGCTGGAG ACCCGGCCCG ACGGCCCCGT CGACCGAGCC GACGGCTCGA CCGACTGCTC GGCTGAGGCC ACCACGACGT CAGTGCCGCT GTGGGCGAAC ACGTTGATCG ATTCGCTCCA GGCCCGCCAA TCCCTGATGG CGCACCTGCA GGCCCGGCAG CAGGCCGATC TGGCCGAGCT GTCCGGCTGC TACCCGGGCC TGCACGAGTT CCTGGCCACC GAGATCGCCC TCGCCCTGGG CATCGCCGAG GGCACCGCGA CCCGGCAACT GGCCGAGGCC GTCGACCTGA CCACGCGGCT GCCCGAGACC TTCGCCGCGC TGGACGAGGG CCGGATCACC CCGGTCAAGG CATCCACCAT CCGGTCCCAC ACCCAGGATC TGGACCTGGA TCAGTCCGCC CTGGTCGAGG CCGACGTCCT GCCGGCGGCG CCCCGGGGGA CCGTGCCCGA ACTGCGACAT GCCGTGCAGC GCGCGGTCAT CCGGCACGAC CCGCACGGCG CCGACGACCG GCATCAGGCC CAGCGCGAGC GCCGCCGGGT CAGTGTCAAG GCCCAGCCCG ACGGGATGGG CAGCCTGTGG CTGCTGTCCA CCGCGCAGGA CGTCGCCACC ATCCAAGCCT GCCTGGTCGC CGTGGGTGAC GCGGCCGCCT CCCCGGACGA CGGTCGCACC GCCGACGCCC GCCGGGTCGA TGCCATGGTC GACCTGTGCG CCGAGGTCCT TGACTCGGGG CAATGGCGGG GCACGGCCCT GCCCACCCGG CAGCGCCGCC GCCCGCACGT CCAGGTGACC GTCCCGATCA CCGCGCTGCT CGACCCCGCC ACGACGGCCG GCCAGAGCGC CGAGTTGCAC GGATACGGCC CCATCACTGC GGCCCAGGCC GCCCAGATCA CCGCCGACGC CACCCTGCGC CGCCTGGTCT GCGACCCGCT GACCGGAGCG CTGCTGGATT ACGGCCGGAC CACCTACCAC CCACCGGCCG CGCTGGCCGA CCACGTGCTG GTCCGCGACC AGACCTGCCG GCTGCCCGGC TGCCGACAGC CCGCGCAGCG CTGCGAGCTC GACCACGTCG AACCGTTCCG CCCGGGCCAC GACACCGGCG GAACCACCAG CGCGACCAAT CTGTGCCTGG TCTGCAAACA CCACCACCGG GCCAAGGACG GCGGCGAGTT CCTTCTCCGG CGCACCGCCG ACGGCTACGA CTGGACCAGC CCGCTGGGCC GCCGCTACCG GCAACCGCCG ACCCGGCTGT GGGAACCGCC GCCGGAACAC CCACCCGAGC GGCCGGCGTC GGTCTTCACC GCCGACCCAC CCGATCGGCC GATCCGGGAC GACGACCCGC CGCCGTTCTG A
|
Protein sequence | MFVDDHWGGR PVRVWSGARP PARWGLPESL THGVVLPFRA SAARAAVRVR SWRETLATTG PGADLARVLE TRPDGPVDRA DGSTDCSAEA TTTSVPLWAN TLIDSLQARQ SLMAHLQARQ QADLAELSGC YPGLHEFLAT EIALALGIAE GTATRQLAEA VDLTTRLPET FAALDEGRIT PVKASTIRSH TQDLDLDQSA LVEADVLPAA PRGTVPELRH AVQRAVIRHD PHGADDRHQA QRERRRVSVK AQPDGMGSLW LLSTAQDVAT IQACLVAVGD AAASPDDGRT ADARRVDAMV DLCAEVLDSG QWRGTALPTR QRRRPHVQVT VPITALLDPA TTAGQSAELH GYGPITAAQA AQITADATLR RLVCDPLTGA LLDYGRTTYH PPAALADHVL VRDQTCRLPG CRQPAQRCEL DHVEPFRPGH DTGGTTSATN LCLVCKHHHR AKDGGEFLLR RTADGYDWTS PLGRRYRQPP TRLWEPPPEH PPERPASVFT ADPPDRPIRD DDPPPF
|
| |