Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2679 |
Symbol | |
ID | 8448291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2933177 |
End bp | 2934841 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 645041771 |
Product | hypothetical protein |
Protein accession | YP_003202014 |
Protein GI | 258652858 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0000692051 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00469721 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGGCGAC TGCACGTAAC GCAGATCGAG CGCAGGCTCG ACGATACCGT GAACAAGCAC ATCGACATGT CCGATGTCCC GTTCACCGGC GACCAGCTCC GAGCCTCATC GTTGAGTCGA GGTCTGGCTG CGTTCATCGT AATGAAGGTC GCCAACGTGG ATCCTCTGAC CGCAGCCAAT AGCGTGACGG ACGAGGGTGG CGATAATGGG ATCGATGCGG TTATTGCGCT CGCTCTTGAA CGCAGGATCG TAATGGTTCA GGCAAAGTGG GCTTCAGACA GTCGGGGGAG TGCAGAGAAA GCCGACATTC TGAAGTTCCG CAAAGGCGTG GACGATTTCG TATCAGGTGA TTGGTCCAAG TTTGGCCCGA AGTTTAACGC AAAGAAGTCT GAACTTGAGC CGTTGCTCTA CGACCCGTCT CTGAGAATTG AGATGATCTT CTGTCATTTG GGGACGGGCG TCCTCTCTGC CGAGTCGGCC CTTCTGATGG ACGAATACCT GGACGACATG AACAATCCAA CCGAGATCGG ATCGTTCCTG TATCTGAACC AGGGGCAGGT ACACCGAATG TTGGTTGACG ATGTCGTCAA GACCAAAGTA AACCTCGACG TGGAACTATC TGACTGGGGA ACGCTTGAAC AGGAGCCGAT TGCATTCTAC GGTCACGTCA GTGGCGAGAC AATCGCCGGG TGGTACATTG AGAACGGCAG CTCACTGCTA TCGCAGAACG TTCGTGTGCT GATTCCAGAT TCTGAAGTCA ACGATGGACT CGTGGCGACC CTAACGGACA CGCCCGGAAA GTTCTGGTAC TTCAACAATG GCCTGACAGT TCTGTGTGAT CGAATATCGA AGGCACCTCT TGGTGGCGCC GATCGTCGCT TAGGCCGCTT TACTGTCGAG GGTGCATCGA TAGTCAACGG AGCGCAGACC GCGGGCTCGC TTGCACGGTT CTCCGCTGCC GCTGGCGACC TCGCCGAGGT GAGAGTTCTC GTCCGTTTTA TCTCTCTCGA GAATTCTCCG GCTGACTTCG CGAAGGATGT TACTCGGGCG ACGAACACTC AGAACCGCAT CGGCGGCCGC GAATTTGTTG CGTTGGATCC AGAGCAGGCA CGACTGCGAG ACGAGTTTGC AGTCGCAGGT ATGACTTACG CGTTTCGAAC CGGTGAGGAG GCACCCTCTC CGTCCGATGG TTGCGACGTC GTGGAAGCCA CGGTGGGGCT TGCTTGTTTG CATAGCGCTC AATTGGCGAC GCAAGCCAAG AGGGAGATTA GTCGGTTGTG GGACGATATA TCACGGGCGC CGTATCGGAC TCTATTCAAT CCGTCAACCA ACTACGTGCG GGTTTGGCGG TCTATCCAGG TCCTTCGAAT GACAGAGGAT TTGCTCCGAA GGGCGCGAGA GAAACTCGAT GGCCGCGAGA AGCTGATAGC CACCCACGGA AATCGGGTAG TTCTGCACTT GCTGTTCCGT CGATGCGACA CTTCGTCAAT AGGGGATCCC GACCACGACT GGGAATCAGA GCTGGAAAAG ATCAAGGGAG AATTTGAGGG GATTCTTGAA AAGGTTTTCG CGATAGTTGA GGGGGAATTC CCGGGTTATC CTGCGAGCTT ATTCAAGAAC GCATCGAAGG TCCAGTCGTT GGTGTACCGA GTTCTCGAGA GCTGA
|
Protein sequence | MGRLHVTQIE RRLDDTVNKH IDMSDVPFTG DQLRASSLSR GLAAFIVMKV ANVDPLTAAN SVTDEGGDNG IDAVIALALE RRIVMVQAKW ASDSRGSAEK ADILKFRKGV DDFVSGDWSK FGPKFNAKKS ELEPLLYDPS LRIEMIFCHL GTGVLSAESA LLMDEYLDDM NNPTEIGSFL YLNQGQVHRM LVDDVVKTKV NLDVELSDWG TLEQEPIAFY GHVSGETIAG WYIENGSSLL SQNVRVLIPD SEVNDGLVAT LTDTPGKFWY FNNGLTVLCD RISKAPLGGA DRRLGRFTVE GASIVNGAQT AGSLARFSAA AGDLAEVRVL VRFISLENSP ADFAKDVTRA TNTQNRIGGR EFVALDPEQA RLRDEFAVAG MTYAFRTGEE APSPSDGCDV VEATVGLACL HSAQLATQAK REISRLWDDI SRAPYRTLFN PSTNYVRVWR SIQVLRMTED LLRRAREKLD GREKLIATHG NRVVLHLLFR RCDTSSIGDP DHDWESELEK IKGEFEGILE KVFAIVEGEF PGYPASLFKN ASKVQSLVYR VLES
|
| |