Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4226 |
Symbol | |
ID | 8449852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4678090 |
End bp | 4681041 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645043275 |
Product | conserved repeat domain protein |
Protein accession | YP_003203504 |
Protein GI | 258654348 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.405356 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACTGC CGATCGTGTC CGCACCGGTG GCCGCGGCCG CGCCGGGCCC GGTGTACGGG ACATTTACCG ACTTGGCCGG GACCGGCGGC GCCTACCGAG GGACGATGAC GCTGGCCCCG GGCTTCCCCG CGGCCACGTT CGCCTCCACC TCTCGTTCGG GCGGAGTCGG CCCGCAGTCC GGGACGACCG CCTGGTTGCC CGCCGGCAGC CCGCCCGGCG TCGTCTACGG ATCAAGCCAG AACCAGGCCT ACCTCAACCT GCGACCGGCA GCCGACAGCG CCGCCAGCCC GTCCGAGACC GTCTACACCT TCGCCCGGCC CACCCCGACG GCCGGCTGGA GCTTCATCCT CGGCGACATC GACGCCGACC AGGTGACGGT CAGCGCGACC ACCGCCGACG GCAGCGCCGT CCCGGTCGCG GCGTTGGGCT TTGCCGGAGC CTTCAACTAC TGCGACGCAT CGCCCCGGCC GAGCTCGTGC TCCGGGGTCG GCGCGCCGTA CGACCTGCCC AGTTGGGATC CGGGCACGGC CACGCTGACC GGCAACTCGG GGGCCAGCGA CACCACCGGC GCCGCCGGTT GGTTCAGCCC GACGGTGCCG ATCAAGACCC TGACGTTCAG CTACCGGTGG CGCAGCGGCG TCCCGGTGTA CCAGACCTGG TTCGCCACCC AGACCCGGTC GGTCTCGGGC ACCGTGACCG CCGGCGGCAG CGGGCTGGCC GGGGTCACCG TCGAGATCGT CGACGGCGGC GGCGCCGTGG TCGGCACCGT CACCACCGGG GCCGACGGCA CCTACGGCCG GGACGGCCTG GCCCCCGGCA CGTACACCGT CCGGGTCGTC ACCCCCGACG GGTACGCGCC GGTGGGGCCG AGCCGGCGCC CCGCCGACCT GAGCGCCGGC GACGCGACCA CCGTCGACTT CGCGCTCGCC CAGGTCGCCG ATCTCTCGGT CATCAAGAAG CTGGACACCG ATCCGGTCGT CGCCGGCGAG CCCATCACCT ACACGCTGAC CGCCACCAAC GCCGGCCCGG CGGACGCCAC CGGGGTCAGC GTGGTCGATA CCGTGCCGCC CGGGCTGACC GGGGTGAGCG GCCAGGTGAC CGGCGGTGCC GCCTGCGTGG TCGACGCCGC CCTGCTCACC TGCCCGGTCG GTGCACTGAT CGTCGGGACG TCGGCCACCG TGCAGGTCAC CGGCACTGTG TCGGCGACCG CCCCGGCGGG CGTCGCGCTG CTCAACCAGG CGGCCGTGAC CGCGGACCAG CCGGACCCGA ACCCCGGCAA CAACCGCGCG TCCGCGGCGG CACGGGTCAC CGCGGCGGCC GACCTGGTCC TGGTCAAGAC GTTCACCCCG GACAACCCGG TCGCCGGCGG GACGGTCAGC TACCAGCTCA CCGTCACCAA CAACGGACCG TCGCGGGCCA CCGGGCTGGC CATCGCCGAC CCGCTGGACC CGGGGGTGAC CGTCGGCACC GTCACCACGA CCGACGGGAC ATGCACCGCG CCCGGTGGGA TCGTCGCCTG CACCGTGCCG GCCCTGGACG TGGGCGACAG CGTGACGGTC ACGGTGCCGG TCACCCTGCC CACCGGGTCC ACCCCGGCGC TGCAGAACGC GGCGTCGGTC ACGGCCGTCA CGCCCGACCC GGATCTGGAA AACAACACCG GGGTGGCGAC CTTCGAACCC AGCCTCGGGG CCAACCTGGC CCTGATCAAG ACGGCCTCAC CGGCGACCGC CATCCCCGGG CAGACGATCC AATACCAGCT GTCCGTCTCC AACGGTGGCC CCTCCGACGC CCCGAACGTG CTGCTCACCG ATTCGATCCC GCTGGGCCTG GACGCGGTCA CGGTGACCGA CGCCGGCGGC GCCAGCTGCA CCGTCACCGA CCAGGTGAGC TGCAGCTGGG CCTCGGTGCC GGTCGAGGCG ACCCGGACCA TGACCCTGAC CGGGATCGTC GCCCCGGACG CTCCCGACGG CGCCCTGACC AATACTGCCG CGGTCACCGC ACCGGTCGAC GAGTCCGACC CCAGCGACAA CACCGCCACC ACGTCGGTGC TCATCACCTC CGCGGCCGAC GTCAGCCTGA CCAAGACCGC CGGCCCCGAC CCGGTGGCCC CCGGTGGCAC CGTGACCTTC ACCCTGACCG TGGGCAACGC CGGCCCCCAG CAGTCCGCGT TGCTGGAGCT GCGCGACCCG ACCCCGGCCG GACTGAGCAT CACCGCCGTC GACGACCCGG ATTGCCTGAC CGATGCCGTC GCGGTGACCT GCCTGATCGC CGGCCTGGAC CCCGACGCCA GCCGCACGGT GACGATCACC GGCACGCTGT CGCCGGACTA CGACGGCGAC GAGCTGACCA ACACCGCGCA GGTGGCCTCG CTGCTCACCG TCGACCCGGA CCCGGCCGAC AACTCGGCCA CGGCCACCGT CGCGGTGATC ACCCCCGAGC CGCCCGGCTC GAACCTGACC GTGAGCAAGA CGGCGACCAC GCCGACGGTT GGTCAGGGCG ACCCGGCCGG GTTCGTGGTC ACCCTGACCA ACCAGGGACC GGCCGACCAG ACCGACGTGG TCATCGCCGA CACCGCCGGC GACGGCCTGG TGATCGGCTC GGCCACCGGC TCCGCGGGCA CCTGGGACGG CGCCGCGGGC CTGTGGACCG TGCCGTCCCT GGCCGCCGGG GCCAGCGCCA CCCTCACCGT GTCCGCGACC GCGACCGCGG TCGGCACCCT GACCAACACG GCGACCCTGA TCAGCTCCGG CCGGCCCGAC ACCGACCCGG CCGACAACTC CGCGAGCGCC ACGGTCCAGG TCAACCCGAC GGCCGACCTG TCCCTGACCA AGTCGGTGAC GCCGGCCGGC GGGGCCCCCG GGCAGCCGGT CACCTACCAG CTCACCGCGA CCAACGCCGG CCCGTCTCCG GCCAGCGGGT GCACGTGGTG GACACGTTGC CGGCCGGGGT GA
|
Protein sequence | MALPIVSAPV AAAAPGPVYG TFTDLAGTGG AYRGTMTLAP GFPAATFAST SRSGGVGPQS GTTAWLPAGS PPGVVYGSSQ NQAYLNLRPA ADSAASPSET VYTFARPTPT AGWSFILGDI DADQVTVSAT TADGSAVPVA ALGFAGAFNY CDASPRPSSC SGVGAPYDLP SWDPGTATLT GNSGASDTTG AAGWFSPTVP IKTLTFSYRW RSGVPVYQTW FATQTRSVSG TVTAGGSGLA GVTVEIVDGG GAVVGTVTTG ADGTYGRDGL APGTYTVRVV TPDGYAPVGP SRRPADLSAG DATTVDFALA QVADLSVIKK LDTDPVVAGE PITYTLTATN AGPADATGVS VVDTVPPGLT GVSGQVTGGA ACVVDAALLT CPVGALIVGT SATVQVTGTV SATAPAGVAL LNQAAVTADQ PDPNPGNNRA SAAARVTAAA DLVLVKTFTP DNPVAGGTVS YQLTVTNNGP SRATGLAIAD PLDPGVTVGT VTTTDGTCTA PGGIVACTVP ALDVGDSVTV TVPVTLPTGS TPALQNAASV TAVTPDPDLE NNTGVATFEP SLGANLALIK TASPATAIPG QTIQYQLSVS NGGPSDAPNV LLTDSIPLGL DAVTVTDAGG ASCTVTDQVS CSWASVPVEA TRTMTLTGIV APDAPDGALT NTAAVTAPVD ESDPSDNTAT TSVLITSAAD VSLTKTAGPD PVAPGGTVTF TLTVGNAGPQ QSALLELRDP TPAGLSITAV DDPDCLTDAV AVTCLIAGLD PDASRTVTIT GTLSPDYDGD ELTNTAQVAS LLTVDPDPAD NSATATVAVI TPEPPGSNLT VSKTATTPTV GQGDPAGFVV TLTNQGPADQ TDVVIADTAG DGLVIGSATG SAGTWDGAAG LWTVPSLAAG ASATLTVSAT ATAVGTLTNT ATLISSGRPD TDPADNSASA TVQVNPTADL SLTKSVTPAG GAPGQPVTYQ LTATNAGPSP ASGCTWWTRC RPG
|
| |