Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2953 |
Symbol | |
ID | 8448566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3235007 |
End bp | 3237829 |
Gene Length | 2823 bp |
Protein Length | 940 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645042038 |
Product | DSH domain protein |
Protein accession | YP_003202280 |
Protein GI | 258653124 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4581] Superfamily II RNA helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000014502 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00117174 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACCCAGG ACAGTCGAAC ACCGCACGAG GACGGGTCAC CGGCGGATCG GTACGCCCGC GCCCAGGTGG CGCAGGCGGC CCGCCGCCAG TACCCGAAGC TGGCCGAGTT CGTGCTGACC CGGCCGTTCG CGCTCGATCC CTTCCAGGTA CAGGGGTGCC AGGCGCTGGA GGACGGCCGG GGCGTGCTGG TGTGCGCGCC GACCGGTGCC GGCAAGACGG TCGTCGGTGA GTTCGCGGTG CACCTGGCGC TGGCCTCGGG CGGCAAGTGC TTCTACACGA CCCCGATCAA GGCGCTGTCC AACCAGAAGT TCGTCGATCT GATCGCCCAG TACGGACCGG ACCGGGTGGG CCTGCTGACC GGCGACACCT CGGTCAATTC CCATGCGCCG GTGGTGGTGA TGACCACCGA AGTGCTGCGC AACATGCTCT ACGCCGGATC CCCGGACCTG GCCGAGCTCA CCCACGTGGT GCTCGACGAG GTGCACTACC TGGCCGACAA GTTCCGCGGG CCGGTGTGGG AGGAGGTCAT CCTGCACCTG GCCGCCGACG TGGCCGTCGT CGGCCTGTCG GCCACGGTAT CCAACGCCGA GGAGTTCGGC GCCTGGTTGG CCGAGGTGCG CGGCGAGCTG GCCGTCGTCG TCGACGAGGT GCGGCCGGTT CCGCTGTGGC CGCACATGAT GGTCGGGCGA CGGCTGTTCG ACCTGTTCAG CGTGCGCGAC CAGGAGGCCG GCCCGTCCGA CCCGCCCGGC TCGGGGCAGC TGCGGATCGA CCCGGCCCTG ACCCGGGCCA TCCACGACGC CGAGGCGCTG GCCGACCGTT TCGGCGGCGG CGGGTCACGG GTGGGCCGCC GGGGGGAGCG GGGACGGCCG CCGGGCGGCC CGCGCTGGCG TCCGCCGAAC CGGGTCGACG TCATCGAGCG GCTGGACATG GCCGGGCTGC TGCCGGCCAT CACCTTCATC TTCTCCCGGG CCGGGTGCGA CGCCGCGGTC GCCCAGTGCG TCCGCTCCGG GCTGCGGCTG ACCACCGAGC ACGAGCGGGA CGAGATCCGG CAGATCGTCG ACCGGCGCAC AGTGGAGCTG CTGGACGCCG ACCTCGGGGT GCTGGGCTAC TGGGAGTGGC GCGAGGGCCT GGAACGAGGG GTGGCCGCGC ACCACGCCGG CCTGCTGCCG GTGTTCAAGG AAACCGTCGA GGAGTTGTTC GTCGCCGGCC TGGTCAAGGC CGTCTTCGCC ACCGAGACGC TGGCCCTGGG CATCAACATG CCGGCCCGCA CGGTAGTGCT GGAGAAGCTG GGCAAGTTCA ACGGGGAGAG CCACGCGGAT CTGACCGCCG GGGAATACAC GCAGCTGACC GGCCGGGCCG GCCGCCGGGG CATCGACGTC GAAGGCCACG CGGTGGTGCT GTGGTCGCCG GGAATGGACC CGCGGGTGGT GGGCGGGCTG GCCTCGCGGC GCACCTACCC GCTGCGCTCC TCGTTCCGGC CCAGCTACAA CATGGCGGTC AACCTGGTCG ACCGGCTCGG CCGGCAGGCG GCCCGGGCAC TGATCGAGCA ATCCTTCGCC CAGTACCAGG CCAACGGCGC GGTGGTCGGC ATGGCCCGTC AGGTCTCCCG CAACACCGAG GCCATCGCCG CGCATCAGCG AACCATGCAG TGCCATCTGG GCGACACCGC CGAATACCTG GGCCTGCTGA CCGAGCTGGC CGATGCGGAA CGGGAGATCG CCCGGGCCGG GGCCGCCCGC CGGCGCGACG CCACCGCCCA GGACCTGGCC GAACTGCGGC GCGGCGACGT GATCGAGGTG CCGACCGGGC GGCGGTCCGG CCTGGCCGTC GTGCTCGATC CCGGGGTCGA TCCGGACGGT TCGGCCCGGC CCCTGGTGGT CACCGCCGGC CGCTGGGCCG GCCGGCTGTC CGCCGCCGAC TTCCGCGGCC GGGTGCCCGC CTTGGGCCGG GTCAAGCTGG GCAAGTTCAC CGACCACCGT TCACCCAAGG TCCGGCGGGA CCTGTCCTCG GCGATCGCCT CCTCCGGTAT CCGCGCGCCC GGCCGGGACC GGCGGGCGGC CCGCGGGCAC GACGCCGCCT CGGCCGACGA GATGGACCTG ACCGTGCTGC GCAAGGCGAT CCGGGCGCAT CCGGTGCACG GCTGCAGCGA CCGCGAGGAG CACCTGCAGT GGGCCCGCCG GTGGCGCCGG CTGATCGCCG AGAACGAGGC CCTGGCCGCC AAGGTTGCGG CGGCGACCGG ATCCCTGGGT CAGGCGCTGG ACCGGATCGT GCGCCTGCTG ACCGACGAGG GGTACCTGGA CGGCGACGCG CTGACCGACG ACGGCCGGAT GCTGGCCCGG ATCTGGTGCG AGAGCGACCT GGTGGTGGCC GAGTGCCTGC GCCGCGGCAC CTGGACCGGC GCGTCCCCGC CGGCGCTGGC CGCGGCGGTG TCCTGCCTGA TCTTCGAATC GCGCCGCGAC AACCCGGGCA TGAGCCGGAT CGCCGTCGGC GAGATCGGCG ACCTGGTGTC GGCGACCGTG GACGTGTGGG CCCGGATCGC CGGTGCGGAG CGGGAGATCG GCCTGCCGGC GACCCGCGAC GTCGACCCCG GGTTTGCCGC CGCGGTCGCC GCCTGGTGCC GGGGGGCGTC GCTGGCCGAG ACCCTGACGG TGGCGGTCAG TGGCGGAACC GACATCTCCG CCGGGGATTT CGTGCGCTGG TGCCGGCAGG TGGTCGATCT GCTCGACCAG ATCGCGGGCG TCGCGCCGGC GCCGGTGGCC GCGATCGCCC GGTCGGCGGT CGGATCTTTG CGGCGCGGCG TGGTATCCCT CGGCGCCGCC TGA
|
Protein sequence | MTQDSRTPHE DGSPADRYAR AQVAQAARRQ YPKLAEFVLT RPFALDPFQV QGCQALEDGR GVLVCAPTGA GKTVVGEFAV HLALASGGKC FYTTPIKALS NQKFVDLIAQ YGPDRVGLLT GDTSVNSHAP VVVMTTEVLR NMLYAGSPDL AELTHVVLDE VHYLADKFRG PVWEEVILHL AADVAVVGLS ATVSNAEEFG AWLAEVRGEL AVVVDEVRPV PLWPHMMVGR RLFDLFSVRD QEAGPSDPPG SGQLRIDPAL TRAIHDAEAL ADRFGGGGSR VGRRGERGRP PGGPRWRPPN RVDVIERLDM AGLLPAITFI FSRAGCDAAV AQCVRSGLRL TTEHERDEIR QIVDRRTVEL LDADLGVLGY WEWREGLERG VAAHHAGLLP VFKETVEELF VAGLVKAVFA TETLALGINM PARTVVLEKL GKFNGESHAD LTAGEYTQLT GRAGRRGIDV EGHAVVLWSP GMDPRVVGGL ASRRTYPLRS SFRPSYNMAV NLVDRLGRQA ARALIEQSFA QYQANGAVVG MARQVSRNTE AIAAHQRTMQ CHLGDTAEYL GLLTELADAE REIARAGAAR RRDATAQDLA ELRRGDVIEV PTGRRSGLAV VLDPGVDPDG SARPLVVTAG RWAGRLSAAD FRGRVPALGR VKLGKFTDHR SPKVRRDLSS AIASSGIRAP GRDRRAARGH DAASADEMDL TVLRKAIRAH PVHGCSDREE HLQWARRWRR LIAENEALAA KVAAATGSLG QALDRIVRLL TDEGYLDGDA LTDDGRMLAR IWCESDLVVA ECLRRGTWTG ASPPALAAAV SCLIFESRRD NPGMSRIAVG EIGDLVSATV DVWARIAGAE REIGLPATRD VDPGFAAAVA AWCRGASLAE TLTVAVSGGT DISAGDFVRW CRQVVDLLDQ IAGVAPAPVA AIARSAVGSL RRGVVSLGAA
|
| |