Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_2172 |
Symbol | |
ID | 4599079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 2324513 |
End bp | 2326867 |
Gene Length | 2355 bp |
Protein Length | 784 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639776774 |
Product | MMPL domain-containing protein |
Protein accession | YP_923367 |
Protein GI | 119716402 |
COG category | [R] General function prediction only |
COG ID | [COG2409] Predicted drug exporters of the RND superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAACCT TCCTGTACCG GCTCGGAAGA ACCGCGTTCG GCAAACCGTG GCTGTTCGTC GCCGGCTGGG TCGCAGTCCT CGCGGTGGTC GTTGGTGGCA TGGCAATCAA CGGGGTAAGC GTCAGCTCCG AGATGAAGAT CGAGGGCACC GAGGCCCAGA CCGTGCTCGA CCGCGTGGCC GATGAGCTCC CCGAGGCCTC GGGAGGCCAG GCCAGCGTGG TCTTCACCGT GCCGGACGGC GAGCGCCTCG ACACTCCGGA GCGACTCGCG GTCATCAGCG GCACCGTCAG CGACGTCTAT GACCTCGAGA AGGTCGTCAA CCCCCTCGAC GCCGCGCTGG GTGCCGCCGA GCAGGGAGGA CCGGGCACCC CCCAGGAGAA TGCACCGGGC GATCCCCCAG CCGGGTCGGA CCAAGAACCG GCACAGGAGC AGGGGCCTCC GTACCAGCCG CTGCTGGTGG ACGGGGAACC GGTGCCCGGC GTGCTGGTGT CCTCGGACGG GCAGGTCGCG CTGTTCCAGT TTCAATTCAC GGTCGCCGCA ACCTCACTGA CAGATGACGA CGTCACCTCG GTGGTCGAGG TGGTGGAACG CGCCGAGCAG GGAACGGGGA TGACCGTTCT ACCGAGCGAC TCGCTCAAGG CCCTCGAGAT CCCCATCGGC ATCGGCGAGG TGATCGGTCT CGCCGTCGCC GCCCTCGTGC TGGTGCTCAC CCTGGGTTCC CTTGTCGCCG CCGGCCTGCC CCTGATCACC GCACTGGTCG GCGTCGGCAT CGGCGTGGGC GGCGCATACG CGCTCTCGAA CGTCGTCGAG ATGAACTCCG CCACTCCCGT CCTCGGCCTC ATGGTCGGGC TCGCCGTCGG CATCGACTAC GCACTGTTCG TCGTCAACCG GCAGCGACGA CTGATCCTCG ACCAGGGACT CACCGCTCAG GAGGCAGCCG GCAGAGCAGT CGGCACCGCG GGCAGTGCCG TGTTCTTCGC CGGCCTGACC GTCCTCATCG CGCTCACCGC GCTCACCGTC ATCGGCATCG CAGTGCTGTC CACGATGGCA CTGGTCGCCG CGTCCACGGT GGCCCTGGCC GTCCTCATCG CCCTGAGCCT TCTGCCGGCG CTGCTCGGTC TGGTCGGGGA GCGGATCTGC TCCGACAAAG CCCGAGCCCG ACGCCGCACC AAGGTCGAGG CCGAGTCGCA CGGCGTCGCT GACCATTGGG TCAAGGGTGT GATCAGGTTC AGGTGGCCTG TCATCGCCGG TGTGGTCGCG ATCCTGGGCG TGATGGCGAT CCCCGCAGCC AGCATGCACC TGGGGATCCC TTCCGGCGCG ACCGCCAACC AGGACACAGC CGCCCGCCAG AGCTACGAGG CAGTGTCCCA AGGCTTCGGC GAGGGATTCA ACGGCCCCCT CCTGGTCACC GCCGAGCCCG TCGGCACCTC AGGCCGCGTC ACGCCCGAGC TGACCGCGAA ACTAATCGGC GAGTTCCAGG ACCGAGGCGA CATTGTGCTG GCCGCCCCCG TTGGCGTCAA CGAGGCTGGC GACCTGGCTG TGTTCAGCAT CATCCCCGCC TCCGGACCCG ATGACGAGGC CACCAGCGAC CTCGTCAAAT CGCTACGCGA GCCCGGCAAC GCCATCGCCC AGCGCAACCA GGTGCAGTTG GGCGTGACCG GGTTCACCGC CATCCGCATC GACATGTCCG ACAAGATCGC CGGCGTTCTT CCCCTCTATC TCGGCATCAT CATCATCCTT TCCATCCTGA TCCTGATGCT GGTCTTCCGC TCGGTCGTGG TCCCAATCAA GGCCACAGCG GGCTTCCTGC TCAGCATCCT GGCCACCTTC GGTGCCACCA CTGCCGTCTT CCAGTGGGGC TGGCTCAGCG GCCTCTTCGG GTTCGACACC GGCGGCCCGC TGATGAGCTT CATGCCGATC ATCGTGACCG GCATCCTCTA CGGACTCGCC ATGGATTACG AGGTCTTCCT GGTCTCCTCG ATGCGCGAGG CGCACATCCA CGGCCAAGCA GCCCGCCAGA GCGTCGTCCA CGGGTTCGAC CAGGCCAGCC GGGTCGTGGT CGCAGCCGCC ATCATCATGG TCGCAGTGTT CTCCGGCTTC ATCTTCAGCC ACGACATCAT GATCAAGCAG ATCGGCTTCG CCCTCGCCGC CGGCATCCTC ATCGACGCCT TCCTCGTCCG GCTGACCCTC GTCCCGGCGC TCATGGCCGC CTTCGACGAG CGAGCATGGT GGCTGCCCCG CTGGCTCGAC CACCTACTGC CGGACCTCGA CATCGAGGGC GACAAGCTCT TGGCCATGCT CAACCAGCAG GCCGAACCCA CCGACCGAGA AGACAACGAC ATCCGCAGCC GATGA
|
Protein sequence | MSTFLYRLGR TAFGKPWLFV AGWVAVLAVV VGGMAINGVS VSSEMKIEGT EAQTVLDRVA DELPEASGGQ ASVVFTVPDG ERLDTPERLA VISGTVSDVY DLEKVVNPLD AALGAAEQGG PGTPQENAPG DPPAGSDQEP AQEQGPPYQP LLVDGEPVPG VLVSSDGQVA LFQFQFTVAA TSLTDDDVTS VVEVVERAEQ GTGMTVLPSD SLKALEIPIG IGEVIGLAVA ALVLVLTLGS LVAAGLPLIT ALVGVGIGVG GAYALSNVVE MNSATPVLGL MVGLAVGIDY ALFVVNRQRR LILDQGLTAQ EAAGRAVGTA GSAVFFAGLT VLIALTALTV IGIAVLSTMA LVAASTVALA VLIALSLLPA LLGLVGERIC SDKARARRRT KVEAESHGVA DHWVKGVIRF RWPVIAGVVA ILGVMAIPAA SMHLGIPSGA TANQDTAARQ SYEAVSQGFG EGFNGPLLVT AEPVGTSGRV TPELTAKLIG EFQDRGDIVL AAPVGVNEAG DLAVFSIIPA SGPDDEATSD LVKSLREPGN AIAQRNQVQL GVTGFTAIRI DMSDKIAGVL PLYLGIIIIL SILILMLVFR SVVVPIKATA GFLLSILATF GATTAVFQWG WLSGLFGFDT GGPLMSFMPI IVTGILYGLA MDYEVFLVSS MREAHIHGQA ARQSVVHGFD QASRVVVAAA IIMVAVFSGF IFSHDIMIKQ IGFALAAGIL IDAFLVRLTL VPALMAAFDE RAWWLPRWLD HLLPDLDIEG DKLLAMLNQQ AEPTDREDND IRSR
|
| |