Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2076 |
Symbol | |
ID | 8447686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 2288563 |
End bp | 2289957 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645041199 |
Product | transposase IS4 family protein |
Protein accession | YP_003201444 |
Protein GI | 258652288 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.181591 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0238985 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAAGT GTACCGGGCT GTACCCGTCG CTCCGAGTCG ATGCCACCGG CAAGCGGGTG GTGTCACACG CCGGGTCGGT GCTGCTCGCC GTGACCGCGG ACAAGATTGG CATCGGGCGG GGTTTGTCCG CGGCGCTGGC ACCGTGGCGC AGGCCGATGG CGGTGCACGA CCCGGGCAAG ATCCTGCTGG ACCTGGCGAT CTCGCTGGCG ATCGGCGGTG ACTGCCTGGC CGACATCGCC CAGTTGCGGA CCGAACCCGC GGTGTTCGGG CACGTGGCGT CCGACCCGAC GGTGTCCCGG CTGATCGACA CCCTCACCGC CGACGCGGTG ACCGCGCTCA AGGCGATCGA CGCCGCGCGG GCCGCGGCCC GCGCCCGGGC GTGGAAGCTG GCTGGACCGG CCGCGCCGGA CCACGATCGG TCGGCGAAGG CGCCGCTGGT CGTGGACGTG GACGCCACCC TGGTCACCGC GCACTCCGAG AAGCAACTCG CAGCAGCAAC ATTCAAGAAG GGATTCGGGT TTCACCCGAT CGGCGCCTGG GCCGACCACG GACCGGGCGG CACCGGCGAA ACCCTGGCGA TGCTGCTGCG GCCAGGCAAC GCCGGATCCA ACACCGCCGC CGACCACATC AGCGTGGTCA AGGCCGCGCT CGCGCAGTTG CCCGCCTCGA GCGCGGCCCG GCAGCCCGGT CGCCGCGTGT TGGTCCGCAC CGACGGCGCC GGCGGCACCC ACGAGTTCGT GGCCTGGCTG ACCCGGCAAC GGGTGCAGTA CTCGGTCGGG TTCACCATGA CCACCGACAT CACCACCCAG GTCGACGCGC TCCCGGACTC CGCGTGGACC CCGGCGTACG ACGGCGACGG CAAGCCCCAC GGGGCGTGGG TGGCCGAGCT AACCGGGGTG CTCAAGCTGA CCGGCTGGCC CACCGGGATG CGGGTCATCG TCCGCGCCGA ACGACCCCAT CCCGGCGCTC AGCTCAAGTT CACCGATTCG AACGGCAACC GGCTCACCAC GTTCGCCACG AACACCGCCG GCGGGCAGCT CGCCGATCTG GAACTGCGGC ACCGGCGCCG CGCCCGCTGC GAGGACCGGA TCCGCAACGC CAAGGACACC GGCCTGAACA ACCTGCCCCT CAAGGACTTC ACGCAGAACC AGGTATGGAT CGAGGTCGTG CAACTGGCCA TCGAACTGAC CGCGTGGATG CAGATGCTCG CATTCACCGG CACCCCGGCC CGGACCTGGG AACCCAAGAA GCTGCGGCAC CGCCTGTTCA GCATCGCTGC CCGGATCGGC CGCAGAGCCC GCCGCACCTG GCTCCGCCTG TCCGCCCACG CACCCCACCG CGACCTGCTC CTGGATGGCC TGGGCCGGCT CCGGAGCCTG CCGCAATCAA CCTGA
|
Protein sequence | MRKCTGLYPS LRVDATGKRV VSHAGSVLLA VTADKIGIGR GLSAALAPWR RPMAVHDPGK ILLDLAISLA IGGDCLADIA QLRTEPAVFG HVASDPTVSR LIDTLTADAV TALKAIDAAR AAARARAWKL AGPAAPDHDR SAKAPLVVDV DATLVTAHSE KQLAAATFKK GFGFHPIGAW ADHGPGGTGE TLAMLLRPGN AGSNTAADHI SVVKAALAQL PASSAARQPG RRVLVRTDGA GGTHEFVAWL TRQRVQYSVG FTMTTDITTQ VDALPDSAWT PAYDGDGKPH GAWVAELTGV LKLTGWPTGM RVIVRAERPH PGAQLKFTDS NGNRLTTFAT NTAGGQLADL ELRHRRRARC EDRIRNAKDT GLNNLPLKDF TQNQVWIEVV QLAIELTAWM QMLAFTGTPA RTWEPKKLRH RLFSIAARIG RRARRTWLRL SAHAPHRDLL LDGLGRLRSL PQST
|
| |