Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3521 |
Symbol | |
ID | 8449140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3865724 |
End bp | 3867805 |
Gene Length | 2082 bp |
Protein Length | 693 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645042599 |
Product | hypothetical protein |
Protein accession | YP_003202835 |
Protein GI | 258653679 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.000399321 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.178114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGCAG TTCAGGTCGG CCGGGCCGCC GTCGGCCGGG AGCCCGCTGC GGCTCCCGCT GCCGACGAGA CCGTTGATGC AGCGGCCGCG CCGCCGAGTG TCCTTGCCCC GCCGGCGATG CCGGGCATCG GGCCGGTCGC GGGACCGGGC TCCGCCGGTG CGCTGCGCCG CCACGGTGGG GAGCTCCACG ACCCGCTCGG CGGATCGCCG GTCAGCCCGG AGGTCAACGG GGCGTTGGCC CGGCTGCAGG GTCGCGGCCG CCCGCTGCCC GAGAGCGTCG CCGGGCCGAT GAGCCGGGCC ATGGGCGCCG ACCTGGGCGG CGTCCGGATC CACACCGGGG CCGAGCCGGC CCGGCTCGCC CGGTCGGTGC AGGCCACCGC GTTCACCCTG GGCCGCGACA TCTTTTTCGG CGCCGGCCAT TACGCGCCCC AGACCTCCGC CGGCCAACGG CTGCTGGCCC ACGAGCTGGC CCACACGATC GCGCCGGGCG GCGGGTCGGC ATCGGCATCG GGTCCGATCA TCGGGCGCGC CGCCGATCCC GTGGAGGCCC AGGCCGATCG GGTCGCCGAC GACGCCCTGC GGGTCCTGCG CCGCCAGCCC GCCGCGCCGG CGGCGCCGGA CGAGCACCCG GCGGCACCGG CGCCGCTGTC GCTGCTGCGC AGCCGGTCGG CCGGCGGTGA TCGATTGCGG CGCAAGGTGG GCTTCGAGGC CGAGCTGATG GTGCCGAGTC TGGGCCCGAG CGCCAACCAG CTCACCTACG CCAAGGAACC CGGCAAGGTC ACCGACTCGA TCAAGTCGTT CCTGGACGGC GGGGTGGCCT ACGGCACCGA CATCGGCGGC AAGGACAGCG GTGCCGACGT CCGGCTGGAC AGCGATCACG GCGCATCGGT CGACCGGCGG CCGATCGTGA ACAAGCTCAT CGAACTGGGC TACGTCACCG GGACACCGTC CGAGCCGCGG ACGAAGATCG AATTCGTCAC CACCGCCATG GACGAACTGG CCCCGGGGTC CACGCGCCGG GTCAAGGAGG TCGTGGGCAA GCTGCGCGGC CAGCTGAGCG CGGCCCTGAC CCAGGCGCAA AGTGGGGAAC TCCACCAGCT CGGGGCGCCG GCGAAGGCCG GGTACAAGAC GGGAGTGCCC GTCGCCGATC TGAAGGCCTG GCTGGGCGCG GACTACGCCG AACTGGATAC GGTGGTCAAG GAGTACCTGA CCGCTGGGGT CAAGGACGAG GTGTACCTGC AGGCGACCGT GGGGGTGATC CCGTCCTCGC TGATGACCTT CTTCGCTCGG GCGTCCCTGC CGGGCAAGGT CCAGGTCGCG CCGCCGTCGG CGGCCCGCCA GCAGATCCTG GGCATGGTCG CCGAGGTGGT GGCCGCGTTC GAGACCAAAT TCTCGACGGC TCCCGAGGAC CACTGGGTGC GCCAGCTCGG CGCCACCTCC AGCCATGCGT TCCTGGGCTT GTTGGGTCTG ATCTACAGCT ACCTGCTGGG TGACACGTTG CACCAGACGT CCGACGGGAC AGAGTCCACC GTCAAGAACG CGGTGCCGTT CCTGATCAAG ATGAGCCCGT ACGGGCTGGT GGCCAAGACC GCCCCGCACG CGCTCAAGGA CAACCCGCCA CCGCGCGAGT TCGTCCGGAG CATCGGTGAC CTGATGAAGA AGACCAAGTA CCTCCAGCTG GCCTACTGGG TCGAGGAGTC CCGCAAGGAC GGCACCGCCG TGAGTGACGG CAAGCTCCCG GCCAAGCTCG ACGCGCGCCC GCGCGCCGAG CGGCTGATCA CCGGCGACTA CACCGATTTC GTCGAGAAGG TGCTGCTGGG AACGGGCGGC GCGGTACCGG TCGTGGTCGG CAAGATGCTG CCGGGGCCGG ACAAGCCGCC GACCGATACG GGCGGGGTGA ATGTTTTCCA CGAGCTCTAC AACCAGCAGG GCATCCCGCT GGAGTACCGG GCCATCTCCA AGCGCTATAC GGTGTCCGAG GTCACGGCGG CCCTCGGCGA GATCCTCGCC GAGCTCCGCG TGATCAACCT GTCCGGTCTC ACCGAGGAAC AGCGGGCCAC CGTGACGGAG GCGTTCAAAT AG
|
Protein sequence | MRAVQVGRAA VGREPAAAPA ADETVDAAAA PPSVLAPPAM PGIGPVAGPG SAGALRRHGG ELHDPLGGSP VSPEVNGALA RLQGRGRPLP ESVAGPMSRA MGADLGGVRI HTGAEPARLA RSVQATAFTL GRDIFFGAGH YAPQTSAGQR LLAHELAHTI APGGGSASAS GPIIGRAADP VEAQADRVAD DALRVLRRQP AAPAAPDEHP AAPAPLSLLR SRSAGGDRLR RKVGFEAELM VPSLGPSANQ LTYAKEPGKV TDSIKSFLDG GVAYGTDIGG KDSGADVRLD SDHGASVDRR PIVNKLIELG YVTGTPSEPR TKIEFVTTAM DELAPGSTRR VKEVVGKLRG QLSAALTQAQ SGELHQLGAP AKAGYKTGVP VADLKAWLGA DYAELDTVVK EYLTAGVKDE VYLQATVGVI PSSLMTFFAR ASLPGKVQVA PPSAARQQIL GMVAEVVAAF ETKFSTAPED HWVRQLGATS SHAFLGLLGL IYSYLLGDTL HQTSDGTEST VKNAVPFLIK MSPYGLVAKT APHALKDNPP PREFVRSIGD LMKKTKYLQL AYWVEESRKD GTAVSDGKLP AKLDARPRAE RLITGDYTDF VEKVLLGTGG AVPVVVGKML PGPDKPPTDT GGVNVFHELY NQQGIPLEYR AISKRYTVSE VTAALGEILA ELRVINLSGL TEEQRATVTE AFK
|
| |