Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1886 |
Symbol | |
ID | 8447493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 2071827 |
End bp | 2073083 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645041016 |
Product | Integrase catalytic region |
Protein accession | YP_003201264 |
Protein GI | 258652108 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0003225 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0562744 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTCTC GGGCCGAGAT CACCACCAGG TACGCCAAGG CGTTCAAGGC TGCGGACAGG CGGACGAAGG GCCGGATCCT GGACGAGGTG GTGTCCGTGA CGGGCTGGTC GCGGGACAAC GCCCGGCGAC GCTTGACCAG CGCCGCGCAG TGCCCGCCGG GCGGCGGCCG ACAGGTCGCC CAGCGGCCCA GGAAGCAGCG GGCGAACAAG TTCTCCTACG AGGCCGTAAA GGTCCTCCAG CGGGTCTGGG CGGCTTCCGG TGGGCAGTGC GGCAAGTACC TGGCCGCATC GATGGACACG CAACTGGACG GGCTGGAACG GCACGGGGAG CTGGTCGACG GCGAGTGCCG GTACAGCGCT TCGGTGCGGG CCGAGCTGCT CGCGATGTCG CCGGCGACGA TCGACCGCTA CCTGCGGACC GCGAAGGCCA CCGACCAGGT CCGCGGTGTC TCGACCACGA AACCGTCACC GTTGCTGCGG TCCTCGATCA AGATCCGTAA GGCGGGCGAC GAGGTCGAGG CCGAGCCCGG CTTCTTCGAG GGCGACACGG TTGCACATTG CGGCCCGACC CTGCGGGGCG AGTTCGCTCG CTCGGTGAAC CTGACCTGTG TGCATACCGG GTGGGTCTTC ACCCGCTCGA CGCGCAACAA CGCGCACGCC AACATCCTGG CCGCGCTGCA GGCCGGGGTG CAGGAGATCC CTTTCGCGGT CACCGGTCTG GACTTCGACA ACGGAGGTGA GTTCCTGAAC CGGGCCGTCA TCAAATGGGC CGCCGAGCGA GACATCTACT TCACCCGGTC CCGGCCGTAC AAGAAGAACG ACCAGGCCAC GATCGAGTCG AAGAACAACC ACCTGGTCCG CCGGTACGCG TTCTACTACC GGTACGACAC CGACGAGGAG CGGCACGCGT TGAACCGGCT CTGGAAGCTG GTCAACGACC GGCTCAACTA CCTCACCCCG ACGATCAAGC CGGTCGGCTG GGGTGAGAAC AAGGCCGGTC GCCGCAAACG CCTGTACGAC AAGCCGCAAA CCCCGTTGAG TCGGCTGCTG GCCGCCGGCA CGCTGTCGCC GGCGCAAGCC CACGAGCTGA CCGCCTACCG GGACGGGCTC AACCCAGCCG CGCTCGCCCG TGAGATCGCC GACATTCAAG CCGTGCTGCT GGGCCTGGCC AAGAACAAGA CCGAGCAGCT CTACCTCGCG ACCGTCCCCA AGGCACTGCC CGACGTGCGC AAAGGCGTCC GGATCCGGGC CGGCTGA
|
Protein sequence | MASRAEITTR YAKAFKAADR RTKGRILDEV VSVTGWSRDN ARRRLTSAAQ CPPGGGRQVA QRPRKQRANK FSYEAVKVLQ RVWAASGGQC GKYLAASMDT QLDGLERHGE LVDGECRYSA SVRAELLAMS PATIDRYLRT AKATDQVRGV STTKPSPLLR SSIKIRKAGD EVEAEPGFFE GDTVAHCGPT LRGEFARSVN LTCVHTGWVF TRSTRNNAHA NILAALQAGV QEIPFAVTGL DFDNGGEFLN RAVIKWAAER DIYFTRSRPY KKNDQATIES KNNHLVRRYA FYYRYDTDEE RHALNRLWKL VNDRLNYLTP TIKPVGWGEN KAGRRKRLYD KPQTPLSRLL AAGTLSPAQA HELTAYRDGL NPAALAREIA DIQAVLLGLA KNKTEQLYLA TVPKALPDVR KGVRIRAG
|
| |