Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1906 |
Symbol | |
ID | 8447513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2099063 |
End bp | 2100883 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645041036 |
Product | Rhs element Vgr protein |
Protein accession | YP_003201284 |
Protein GI | 258652128 |
COG category | [R] General function prediction only |
COG ID | [COG3500] Phage protein D |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0194585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0014616 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGCCCA CCTCGTCCAG CTTCCTGGTC GACATCGACG GCTCGCCGCT GGCCGCCGAC GCCAAGGCAC TGCTGGTCTC GGCGATCGTC GACGACAGCC TGCGGCTGCC CGACTTCTTC CTGCTCCGCT TCCGTGATCC GGACCGGTTG GTGATCACCA AGTCCGGCGC CAAGATCGGG TCCAAGGTCA AGGTCAGCGT GGCCACCGAC GCCGCGCCCA GCCCGTTGCC GCTGATCGAG GGGGAGATCA CCGCGCTGGA GGCCGAGTAC GACGCCACCG GCACCTACAC CGTCATCCGC GGCTACGACC AGGCCAACCG GCTCTTCCGG GGCCGGCGCA CCGAGTCCTA CACCCAGAGC ACCGCCTCGG ACGTGGCCAC CAAGGTGGCC CAGCGGGCGG GGCTGTCCAT CGGCGAGGTC GAGTCGACCA GCACCGTCTA CGAGCACCTG TCCCAGGGCG GGGTCACCGA CTGGGAGTTC CTGGACGGCC TGGCCCGCGA GATCGGCTAC GAGATCGCCG TCAAGGACGG CAAGTTCGAC TTCCGCAAGC CGAAGAAGGC CGACACCGCC CCGGCCGGCG GCGGGGGACC GGAGCAGCAG AACCCGTTGG TGCTGCGGCT GGGTACCGAC CTGCTGCGGT TCCGGTCACT GATCACCGCG GCCGAGCAGG TCAAGGAGGT GCAGGCCCGC GGCTGGGACC TGGCCCAGAA GAAGGCGTTC GTGGCCACCG CGCCGGCCGC GACCACCTCG GCCGTGCTGC CCGCCTACAA CCCGGTCGAC ATCGCCAAGA AGTTCGGCGA TCCGGTCTAC GTGGCCACCG ATGTCGCCTA CCGCAGCCAG GCCGAGGTGG ACAGCGCGGC CGCGGCCATC GCCGAGCAGA TCGCCGGTGC CTTCGCCGAG TTCGAGGGTG TCGCCCGGGG CAATCCGAAA CTGCACGCCG GGGCGGCGAT CTCGGTGGAC AACGTGGGGG CCCCGTTCGA CGGGAAGTAC ACCATCACCT CCTCCCATCA TCGCTACGAC CCGAACACCG GCTACACCAC GATGTTCTCG GTCACCGGCC GCTCGGAACG CAGCCTCTAC GGGCTGGCCA ACGGCGGTGG TGGCGGCAAG CTCGGGCAGG GGCCGGTCGT CGCGCAGGTC AGCGACGCCA AGGACCCGCT GGAACAGGGC CGGGTCAAGC TGACCTTCCC CTGGTTGTCG GACACCTATG TCAGCGACTG GGCCCGCACC GTGCAGCCGG GGGCCGGCAA GGACCGGGGG GCGCTGGTGC TGCCCGAGGT CGGCGACGAG GTGCTGGTGC TCTTCGAGCA GGGCGACATC CGCCGGCCCT ACGTGCTCGG TGGCCTGTTC AACGGAGTGG ACACCGCACC CAAGGGCAAA CCCGACCTGA TCGACGGCAG CTCCGGAGCG ATCAACCGGC GCTCGTTCGT CTCCCGCCGC GGTCACCGCA TCGACCTGAT CGACGAGGAC GGCCGGACCG AGGGCATCAC GCTGTCCACC ACCGGCGACA AGCTGCAGCT CAAGCTGGAC TCGGTCGAGA CCAAGATCAC CGTGCACAGC GACGGCAAGA TCCTGATCGA GGGCAAGGGC GGCGTGCTGA TCGACTCGGC CAGCAGCAAG CTCGAACTCA AGGGCGGCGA GGTCTCGATC ACCTCCACCA GCGGGGTCAA GATCGACGGC GGCAGCGGTG GGGTGGACGT GCAGACCAAC GGCCAACTCT CGCTCAAGGG CAGCACCGCC AAGCTGGAGG GTCAGGCCAG CGCCGAGGTC AAGGCCAGCG GCGTGCTGAC CGTCCAGGGT TCCCTGGTCA AGATCAACTG A
|
Protein sequence | MAPTSSSFLV DIDGSPLAAD AKALLVSAIV DDSLRLPDFF LLRFRDPDRL VITKSGAKIG SKVKVSVATD AAPSPLPLIE GEITALEAEY DATGTYTVIR GYDQANRLFR GRRTESYTQS TASDVATKVA QRAGLSIGEV ESTSTVYEHL SQGGVTDWEF LDGLAREIGY EIAVKDGKFD FRKPKKADTA PAGGGGPEQQ NPLVLRLGTD LLRFRSLITA AEQVKEVQAR GWDLAQKKAF VATAPAATTS AVLPAYNPVD IAKKFGDPVY VATDVAYRSQ AEVDSAAAAI AEQIAGAFAE FEGVARGNPK LHAGAAISVD NVGAPFDGKY TITSSHHRYD PNTGYTTMFS VTGRSERSLY GLANGGGGGK LGQGPVVAQV SDAKDPLEQG RVKLTFPWLS DTYVSDWART VQPGAGKDRG ALVLPEVGDE VLVLFEQGDI RRPYVLGGLF NGVDTAPKGK PDLIDGSSGA INRRSFVSRR GHRIDLIDED GRTEGITLST TGDKLQLKLD SVETKITVHS DGKILIEGKG GVLIDSASSK LELKGGEVSI TSTSGVKIDG GSGGVDVQTN GQLSLKGSTA KLEGQASAEV KASGVLTVQG SLVKIN
|
| |