Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3706 |
Symbol | |
ID | 8449325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4069238 |
End bp | 4070242 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645042764 |
Product | integrase family protein |
Protein accession | YP_003203000 |
Protein GI | 258653844 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.000354013 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.710204 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTTCG TGCCCTGTTT GCAGCGCCCG GCCGGGGAGG TGCCGCGGCT CGGGGAGGTG CTGTTGGACG AGTACCTGCG GTTCGTCGCG GCGCGGTGTC GGCCGAATAC GCTGCTGGCT CAGATGTTCG ACCTGAAGGT GTTCTTCACG GTCGTGGGGC GGCCGCCGGT GGAGGTGACC ACGGCCGACG TCCTTCGCTT CATCGAGCGG CAGCGGGCAG CCCGTAACGG CAACGTGGTC CGGCTCGCCG ATGGCGAGTC GGGGTTGGCG TTGACGACGA TCAAGCGGCG TCTCGCGACG GTCTCCGGCC TGTTCGAGTA TCTGGCCATT CGCGGGTTGG TGGCACGGAA CCCGGTGCCG CGCAGTCTGT CTGCTCGGCC GGGCCGGGCA CCTGTGCGAG GTGCTCCGTT GATCCGGGCG CCGCGCCGGC TGCCGCGGAT CCTGAGCCCG GCCGAGGTGA ACGCGCTGAT CAGCGCGCTG CGGACGGCCC GGGACCGGGC CATGGTGTGG CTGATGCTGC TCGGCGGCCT TCGCCGCTGC GAGGTCCTGG GCCTGCGGCA TCGTGACGTC CAACCGGGCG AGCGGCGAGT GTTCGTCACC GGCAAGGGCG GCCACGAGCG GGTCGTGCCG GTCGGGAAGG TGTTCTTCGC CGAGCTGGCC GGCTATTACG CCACGGAGCG ACCAGACACC GACACCGATC AGGTGTTCGT TGTGTTGAAG GGACAACGCC GAGGCCAGCC GCTGTCCGCG GCCGGGGTGG ACGAGGTGCT CTCCGGCGCC CGCCGGCGGG CCGGGCTCGC CCACGCCACC TGCCATGAGT TGCGCCATAC CTGCTTCACC CGGCTCCGCG AATCCGGGAT GGCGTTGGAG GCGATCCAGG CCCAGGCTGG GCACGTCTCG ATCGAGACCA CCAAGATCTA CCTGCATCTG GCCCCGGACT GGCTGGTCGA CGAGTACCGA AAGGCGATGG ACATCCTCGA CGACATCGCG GGAGCTCAAG GATGA
|
Protein sequence | MEFVPCLQRP AGEVPRLGEV LLDEYLRFVA ARCRPNTLLA QMFDLKVFFT VVGRPPVEVT TADVLRFIER QRAARNGNVV RLADGESGLA LTTIKRRLAT VSGLFEYLAI RGLVARNPVP RSLSARPGRA PVRGAPLIRA PRRLPRILSP AEVNALISAL RTARDRAMVW LMLLGGLRRC EVLGLRHRDV QPGERRVFVT GKGGHERVVP VGKVFFAELA GYYATERPDT DTDQVFVVLK GQRRGQPLSA AGVDEVLSGA RRRAGLAHAT CHELRHTCFT RLRESGMALE AIQAQAGHVS IETTKIYLHL APDWLVDEYR KAMDILDDIA GAQG
|
| |