Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2708 |
Symbol | |
ID | 8448320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2968485 |
End bp | 2971601 |
Gene Length | 3117 bp |
Protein Length | 1038 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645041800 |
Product | hypothetical protein |
Protein accession | YP_003202043 |
Protein GI | 258652887 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00403632 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00987423 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACCGACA TTCCCACCGC GCACCACGCC GGCCCGACGA CGCCCGGCGA CCCCGGAACG GACCCGGTCG GCTCCTGGGA GCTGGACGTC TACCGGCAGG TCGTCGCCGG ACCGGGCGCG GCCAAACTGC AGGACTCCCG GGTGGCGCCG CTGGTCGCCG CCGCTCGCGG CTACTGGCAC GCCGACCAGG CCGCCGAGAA GGTCTGCGCC GACCTGGCCG GGGTGAACTT CCGCGGCCGC ACCGGCGAAC AGCTGCGGGC CGCGCTGCGC ATGGGCTCGA TGGTGATGCC CCGGTACCTG CCCGCGTCGG GCCGGGTCTG GGAGTCATCG AGTGACGGGT CGGATGGGCG GCTCCGCCCG TTCCTGATCG AACTGAAGCC GGCCCACCCG CGGACCGAGA CCGACCGTGC CTCCGGCAAG GTCCGGGTGA TCAAGTACGA GACCCTGCGG CCGACCGGGT CCACTGCCGG CTCCATCATC GACGTGCACC CGGCCACCCC ACGCGACTGG GACGACGCGC CGGTCGCGCT CATCACCGAA GGAACCCTCA AGGGTGATGC CGCCCTCACC GCCCTGCTGC TGGCCGCCGG GATCACCCGC GACGACCTGT CGTTCCCCGC CTCGATGCCG CCGACCCAGC GGCAGGCGCT GGCCCGGCTG CGGGAACTGA TGGACCGGGT GCCCGCGGAC CGTCGCACCC TGATCCTGAC CGTCACCGGG ATCGCCAACT GGCACCACAA CCCGGAATGG GCCGAGGTGA ACCTGCGGGA CAAGCGGGTC TGGCTCACGC CGGACGGCGA CGTGCACCGC AACCCGCAGG TGTGGGACCA GGCCGACCAG GCGTGGACCT ACCTGTCCAG CCGACGCCGC GCCCAGGTCA AGCTACTGGA CCTGGGCGCG GCAACCGCGC CCGCCGCCCT GTCCGACGAC GACAGCGCCA ACATCGCCGT CACGCCCGAG GAAGAACCTG ACGACACCGA CGACGACGTT CCCGAACCCG ACGGTCCGGC CGCCGGGCCG AAAGGGCTGG ACGACTACCT CGGCCAGGCC GGCGGCTGGG ATACCCTCCC GCCGTTGCTG CGGCAGCAAC TGCCGCCCCG ACCGCGCGGC AAACGTCGAC CCACCGGCAC CCTCGAGGTC GACGAGAAGC GTTGCGTCGT CAAGGAATGG CACGACTCCG ACACGCGCAG CGGCAAACCC GCGGGCTGGC GGGTTAACAG CCACATCGCC GGCCGCGTCG TCGCCGTCAA CGACCAGCGC AACGCCTCCG CCCTGGAGAT CCGCACCGGA CTGATCGACC AGAGCGCCGA GGCCGAAAGT TCCGTGCGGG CAACCGTCGT CATCGAGGTC TACTGGATCG ACCAGGACGG CGACCGGGTC AGCCATCGGA TCACCGGTCC CGACTCCCTG CTGGTCGACA ACCCGGCCTC CTGGCACACC GAACGCGTCG GCGGAAACGT CCCCAGCGCT GTCAAGGCCC ACCCATCCTG GCCGTGCGAC ATGAAATGGC TGGCCGCGAT CAAGGCCCAC CGCCCCCGCG AGACGACCCG CCAGTCCCGG TGGGGACACA TGGGCTGGGT GCCCACCGAC GACGGCCACC CGGTGTTCCT GGTCGGCGCC CAGGTGATCG GCGCCGACGG GCTCATGCCG GCCAAGGCCA CCCCCGGCGT CAGCCCCGAC ATCCTGGCCA AGGCCAACCT GTTCGGCGTC ATCGAACCCG CCGACGACGA GCACCTGCGG ACCGCGCTGC GCAACGTCGT GCACGCCTAC GACCGGGTCT GGACCAACCC CCGGTACACT GCGATCGCGC TGGCCACCGC GCTCCGCCCC ATCGTCCCGC TGCCGTACAA CACCCCGGTG ATGTGCACAG GGGCGTCGGG CAAGGGCAAG AGTCACATGG CCAGCCTGAT CATGAGTTTC TGGCAGCACG CCCCCGGCAC CTGGAAGCCG TCGCTGCTGC CCGGGTCGAT GCTCGACACG ATGGCCAGCA CGCAGTACTC GCTGTCTCGA ACCCCGATCT GGGTGGTCGA CGACCTCGCC CCTTCCTCCG ACGATCGGCG GGCCAAGTCC AACGAGGCGC TGATGGAGGA CCTGCTGCGC GCGGTGCACA ACCAGGCCAA CCGGGCCCGC ATGATGATGA CCGCCACCGA CATGACCCAG CGCCGCGAAC AACCACCCAG GGCCGTCATC GTCGGCACCG CCGAGAACGA GGTGTCCACT AACTCCGCCG CCAACCGGGC CGTCAACCTG ACCTTCCAGG CCGGGTCGCT GCTCGGAAAT GAGGTGGATG CCGCCGACGA GATCCGCGAC GACACCGGCG AGGCCGCCAT GGTGACCTTC GGCGCGATCC GCGCCATGGC CCGCGTCGCC GGCGACAACT GGGAACTCGC CGCCGGACTC TGGCAGACCC GCAAGGACGA CCTGATGCGC GACGGCGTCG AACGGCTCGG CGCCGAGGGC AGCGCCCGCC GGCAGATGGA CATGGCCGCC GACCTGGCCA TCGGCCTGCT ACCCCTGGGC TACCTGGCGC AGATGCTCGG GCTCGAAGAG ATCCACCGAC GGGTCGACGG CTGGATCGAC GACATCTTCG ACCACGTCTT CGCCGAGACC CTGCGGATCC ACCAGAACAC CCCGGGCCGG GCCCTGCTCG CCGCGGTCCG CGCCGCACTG CGCGCCGGAG CCGCCTACAT CGAAGCCCGC GACACCCCCA CCCAGCCGCC CCTGCAGGAA GGCAAGGGGA CCGTGTCCGC GACCATGCTC GGCTGGCAGG CCGCCGGAGA CGGAGCCCTC CGACCCGGCG GCACCCGCAT CGGCAGCCTC GTCGCCAAGA ACGACCACCT GTACGTCCTG CTCGACCCGG CCAACGCGTT CAAGATCGCC CAACGCCACC AACCCGACCT GATCCCCTAC GGCTCCCGGC AAAACACCTC TTGGGCCGCC GTCTGGGCCG AAGGGCTCAC CTGCCCCGAC GGCGACCCGT GGAAGCGCCA TCAGGTTGGT CAGCATCTGC GACCGGTCGT CCGGGCCTAC GGCCTTGAAG CAGTGCCGTT CCCTCTTGAT CAGATCGTGG GTGGCGAAGT CGGTCTGGAG CGATTGGGGG ATCGCCAACC AAACTAA
|
Protein sequence | MTDIPTAHHA GPTTPGDPGT DPVGSWELDV YRQVVAGPGA AKLQDSRVAP LVAAARGYWH ADQAAEKVCA DLAGVNFRGR TGEQLRAALR MGSMVMPRYL PASGRVWESS SDGSDGRLRP FLIELKPAHP RTETDRASGK VRVIKYETLR PTGSTAGSII DVHPATPRDW DDAPVALITE GTLKGDAALT ALLLAAGITR DDLSFPASMP PTQRQALARL RELMDRVPAD RRTLILTVTG IANWHHNPEW AEVNLRDKRV WLTPDGDVHR NPQVWDQADQ AWTYLSSRRR AQVKLLDLGA ATAPAALSDD DSANIAVTPE EEPDDTDDDV PEPDGPAAGP KGLDDYLGQA GGWDTLPPLL RQQLPPRPRG KRRPTGTLEV DEKRCVVKEW HDSDTRSGKP AGWRVNSHIA GRVVAVNDQR NASALEIRTG LIDQSAEAES SVRATVVIEV YWIDQDGDRV SHRITGPDSL LVDNPASWHT ERVGGNVPSA VKAHPSWPCD MKWLAAIKAH RPRETTRQSR WGHMGWVPTD DGHPVFLVGA QVIGADGLMP AKATPGVSPD ILAKANLFGV IEPADDEHLR TALRNVVHAY DRVWTNPRYT AIALATALRP IVPLPYNTPV MCTGASGKGK SHMASLIMSF WQHAPGTWKP SLLPGSMLDT MASTQYSLSR TPIWVVDDLA PSSDDRRAKS NEALMEDLLR AVHNQANRAR MMMTATDMTQ RREQPPRAVI VGTAENEVST NSAANRAVNL TFQAGSLLGN EVDAADEIRD DTGEAAMVTF GAIRAMARVA GDNWELAAGL WQTRKDDLMR DGVERLGAEG SARRQMDMAA DLAIGLLPLG YLAQMLGLEE IHRRVDGWID DIFDHVFAET LRIHQNTPGR ALLAAVRAAL RAGAAYIEAR DTPTQPPLQE GKGTVSATML GWQAAGDGAL RPGGTRIGSL VAKNDHLYVL LDPANAFKIA QRHQPDLIPY GSRQNTSWAA VWAEGLTCPD GDPWKRHQVG QHLRPVVRAY GLEAVPFPLD QIVGGEVGLE RLGDRQPN
|
| |