Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1329 |
Symbol | |
ID | 8446925 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 1458918 |
End bp | 1461923 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645040462 |
Product | hypothetical protein |
Protein accession | YP_003200721 |
Protein GI | 258651565 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.031741 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCGGA TCCGCGCCCG CTGGACCGGT TCCACCCGGT CCGTCCGGTC CACGGTGGTG ATCGTGGTGC TGACCCTGCT GGCCTCGGTG CTGGTGGTAC TGGGGGTCAA CGCCCAGGGC TACCCGGTGA CCGACGTCAA CCTGGCGTCC TCGACGGTCT GGGTGACGAA CGAGGCCAAG GGCGTGGTCG GCCGGGTGAA CCGGCAGATC GACGAGCTCA ACTCGTCCGT CAAGGCCAAC AAGCCCGGTT TCGACGTGCT GCAGGACGGC GACGCCGTGC TCGTGGTCGA CCGGATCAAG AACGAGGTCC GGCCGGTCGA CGTCGCCGCG GTGGTGCTGA CCTCGCGGAT CGGCCTGCCG GAGAACGCCT CCGTCGGATT CGGCGGCGGC ACCCTGGCCG TGGCCGACCC GGTCACCGGC CAGCTGTGGG CCGGCACCCC GGACTCGCTG ACCGGCCTGG ACCCGGCGTC GACCGAGCCG CTGCTGACCG CGGGGGCCGG GGCGCGGGTG GCCGTCTCGG TCGGCGGGAC CGTCTTCGCG GTCGCCCCGG GCAGCGACGC GCTGTGGTCG GCGCCGATCG GGGACACCGG CGAGGTGACC GGCTGGGCCG TCGACGACGC CGGCGCCCCG GTCGCCCCGG AACCGGCCCG GCTGCCCGGC GGCCCGCTGA CCCCGGTGGC CGCCCAGGTG GTCTCGCAGC AGGCGCCGAC GGTGGATCTG ACCGCGGTCG GCGATGTCCC GGTGGTACTG GATCGGGCGA GCGGGGAGCT GATCCTGCCC GACCGGCGGA TCGCGATCCC CGGCGGCGCG GAGGCGGTGC TGCAGCAGCC GGGACCGGCC GCCGACGCGG TGCTGGTCGC CACCGGGACG GCGTTGCTGC GGGTGCCGCT GACCGGCGGC GACCCGCAGA CCGTGGTGGA CGGCGTCACC GGCACCCCGA GCGCGCCGGT GGCGCTGGAC GGCTGCGCGC ACGCGGCCTG GGCCGGCCAG CAGCCCCGTT ACGCCGTCGC CTGCGGGGAC GACCCGGCCC GGCTGATCGA CGTGCCGAAC GCGGCCGCGG CCGCCCGGCT GGTCTTCCGG GTGAACCGGA ACGTGATCGT GCTCAACGAC CAGGCCACCG GCGACATCTG GCTGGTCGAC GCGGACATGA AGCTGATCAG CAACTGGGAC GACGTGCAGC GTCAGGACTC CAGCAGCGAT CAGACCTCGC CGGACGGCGA GCAGAACAGC TCCGACCAGC TCACCAACTC CCGCACGGAC TGCGCGGCCA CCTCCGCCGC GGGCACCCCG CCGCAGGCCG CGGCCGACGA GTTCGGGGTC CGCGCCGGCC GGACCACGGT GCTGCGGGTG CTGGACAACG ACTCCTCGGC CGACTGCTCC ATGCTGGCGA TCAGCGCGGT CGCCCCGCTG CCGGCCGAGC AGGGCACGGT CGCCGTCGCC GAGGGCGGTC AGGCGCTCCA GCTGACCGTG CCGGCCACCG CCACCGGGCC GCTGCCGACC ATCGACTACA CCGTCGACGA CGGCCTGGGC CGCACCTCGA CGGCGCAGGT CAGCGTGAGC GTGGCCGCCG CCGACGACGT GCGGGCGCCC CGCAAGCTGC GCGACTCGGC GACCCTGGTC GAGGTCGGCG GCACGGTCTC CTACAACGTG CTGCCCGACT TCCGCTCGCC GGTCGGCGAG GATCTGTCGC TGATCTCGGC CACCGCCACC ACCGACGACT CGGTGACCTT CCAGCCCAAC GGCCTGGTCA CCTTCCGGGA CACCGGATCG GCCGGCGCGA TCAAGAAGAC CGTCGACCTG GTGATCTCCG ACGGCACCAG CCAGATCAAC GGCGCCCTGA CCATCGACGT CAAGGCCGAG GGCACCACCA CCCCGGTGCC CGGCCCGGTC GCCGCCAGCG GTGTCGTCGA CGAGGCGGTC ACCGTGAACC CGTTGCGCAG CGTGCTGTCC GGGCTGCGCG ACCCGGCCCG GGTCACCGCG GTGCAGCCGC TGGTGGCGGC CACCCCGGAC CGGCCGGCAT CCGCGAGCGC CACCCTGAAC CCGCTGGATT CCACCGTCGT GCTCACCGGG ACCGCCCCGG GCACCAGCTA CTTCCTGTAC ACGGTGGTCG CCGGCTCGGC CAGCGCCACC GGGGTCATCC GCTTCGACGT GACCGCCCCG CCCGACACCC CGGCCCCGCC CGTGGTCACC CCGGACGTCG GCTACCTGGT GCCCGGCCGG ACCCTGGTGC TCGACCCGCT GGCCAACGAC CGCGACCCGA TGGGCGCGGT GCTCTCGATC CAGCAGCTGA GCCAGCCGGC GGACAGTCCG CTGACCGCGA CCGTGCACGA CCTGAGCCTG CTGCAGGTCT CCTCCGCGCG CAGCGTGCCG GCCACCGGCG TCACCCTGAC CTACACCGCG GCGAACACGG CCGGGTCGAC GACCGGCCAA ATCCGGGTGA TCCCGGTGCC CGCGCCGCTG ACCCCGCAAC CCCCGGTCGC CGCCGACATC GCGGTGACGG TGCGGGCCGG GGACGCGGTG ACCGTGCCGA TCTCCCGGTA CGCCACCGAC CCGACCGGCC AGGTGCTGAC GGTCAAGCCG TTCCCCGACG GCACCCTGCC GGCCGACCAG GGCCTGGTGT TCGCCACCGA ATCGGCGATT CGCTACCTGG CCCCGGCCAC CCCGCCGCCG ACCTCCGTCC GGTTCAGCTA CACGGTCGTC AACACCGATC AGCTCACCGA CACCCGGTCG GTCACCATCT CGGTGCTGCC CGCCGATCCG GCAATCAACA CCGCCCCGCA GACCCCGCCG CTGACCACCG CGCGGGTGTT CGCCGGCCGC ACCGCGACGA TCCCGCTGCC GTTGGACGGG CTCGACCCGG ACGGCGACTG GGTCACCTTC GCCGGGCAGA CCGATCCGGC GCCGACCCTG GGCCGGGTGG ACAAGGCCGG CCCGGCCACC CTGACCTACA CCGCGCTGGG CGCCCCCGGC CTGGACGCCG CCGGCTACCT GGTCACCGAC CCGTAA
|
Protein sequence | MRRIRARWTG STRSVRSTVV IVVLTLLASV LVVLGVNAQG YPVTDVNLAS STVWVTNEAK GVVGRVNRQI DELNSSVKAN KPGFDVLQDG DAVLVVDRIK NEVRPVDVAA VVLTSRIGLP ENASVGFGGG TLAVADPVTG QLWAGTPDSL TGLDPASTEP LLTAGAGARV AVSVGGTVFA VAPGSDALWS APIGDTGEVT GWAVDDAGAP VAPEPARLPG GPLTPVAAQV VSQQAPTVDL TAVGDVPVVL DRASGELILP DRRIAIPGGA EAVLQQPGPA ADAVLVATGT ALLRVPLTGG DPQTVVDGVT GTPSAPVALD GCAHAAWAGQ QPRYAVACGD DPARLIDVPN AAAAARLVFR VNRNVIVLND QATGDIWLVD ADMKLISNWD DVQRQDSSSD QTSPDGEQNS SDQLTNSRTD CAATSAAGTP PQAAADEFGV RAGRTTVLRV LDNDSSADCS MLAISAVAPL PAEQGTVAVA EGGQALQLTV PATATGPLPT IDYTVDDGLG RTSTAQVSVS VAAADDVRAP RKLRDSATLV EVGGTVSYNV LPDFRSPVGE DLSLISATAT TDDSVTFQPN GLVTFRDTGS AGAIKKTVDL VISDGTSQIN GALTIDVKAE GTTTPVPGPV AASGVVDEAV TVNPLRSVLS GLRDPARVTA VQPLVAATPD RPASASATLN PLDSTVVLTG TAPGTSYFLY TVVAGSASAT GVIRFDVTAP PDTPAPPVVT PDVGYLVPGR TLVLDPLAND RDPMGAVLSI QQLSQPADSP LTATVHDLSL LQVSSARSVP ATGVTLTYTA ANTAGSTTGQ IRVIPVPAPL TPQPPVAADI AVTVRAGDAV TVPISRYATD PTGQVLTVKP FPDGTLPADQ GLVFATESAI RYLAPATPPP TSVRFSYTVV NTDQLTDTRS VTISVLPADP AINTAPQTPP LTTARVFAGR TATIPLPLDG LDPDGDWVTF AGQTDPAPTL GRVDKAGPAT LTYTALGAPG LDAAGYLVTD P
|
| |