Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1386 |
Symbol | |
ID | 5703745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1600205 |
End bp | 1602490 |
Gene Length | 2286 bp |
Protein Length | 761 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641270896 |
Product | MMPL domain-containing protein |
Protein accession | YP_001536277 |
Protein GI | 159037024 |
COG category | [R] General function prediction only |
COG ID | [COG2409] Predicted drug exporters of the RND superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.33888 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.278999 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGTAACG CCCTGTACTC GTACGGCCGG CTGGCCGCCC GCCGGCCCTG GCAGCTGCTC GCCGCCTGGC TGCTGGTGGC CGCGGCCATC GTCGGCGCCT GGTCGGTGGT GGGCACCACC ATCGACGACG ACGTCCGCAT TCCCGGCAGC GACAGCAACC GCGCCCAGGA GGTCCAGGCT GCGGTGTTCC CCGCCTCCGC GCTCGGCAAC GGCACCCTGG TGTTCCACAA CGCGGACGGG TCTGTCACCT CCACCGAGGA TCGGACGGCC ATCGAGGCGT CGCTCCGCGC GGTGCAGGAA CTGGATGAGG TGACGCAGGT GGTGTCACCG TTTCCGCAGC AGCCCGGCCA GCCCGCGCCG CGGGTCAGCG CCGACGGGCA CACCGCGTAC GCGCGGGTCT ACTTCGACGT GCCCTCCGCC GCCCTCGACG AGCAGGCCGC CGACCGCGTG TTGGCCGCCG CCGCTCCGGC CACGGACGCC GGGCTGGAGG TACTACCCGG TGGCCAGCTC GCCAAGGCGG CGGCCGGCGA TCCCGGGCAC CGCAGCGAGC TGATCGGCCT GGCGGTGGCT GCCGTCGTCC TGTTGGTAGC CCTCGGCGCG GCCGCGGCGA TGGCCCTACC GATCATCTCC GCGCTGGTCG GCCTCGTCCT CGGGCTCGCC GCCATCGGGC TGCTCAGCCA GTTCGGGGCG ATCCCGGACC TCGCGACGAC GGTGGCGAGC ATGATTTCGC TCGGCGTCGG CATCGACTAC GCGCTGTTCA TCGTCGTGCG CTACCGCGCG GCCCGGCAGG AAGGCGACTC GCACGAGCGG GCGCTGGGTG TCGCGGTGGC CACCGCAGGC GCGGCCGTGC TCTTCGCCGG CGCCACCGTC GCCGTCGGCC TGGGCGGCCT GCTGCTGGCC GGGCTGCCGC TGCTGACCTC GCTGGGGTGG ACCGCCGCCG TGGCGGTCGG GTTGTCGGTG CTGGCCGCGG TCGGTGTGCT GCCGGCCGTA CTCGGGATCG TCGGCTCGCG ACTCGGTGCC GGGGCACTGC TGTGGCATCG GTCGAACGCC CCGAAGGCTG GCTGGTGGCG CCGGATCGGC GAGGGCACGG CCCGGCGGCC CTGGCTGGCG GTGGTCGGTT CACTCATGGT GCTGGCGGTG TTCATCGCTC CGGTGGCTGG CCTGACGCTT GGGCAGCAGG ACGACGGCCA CGACCCGGCA GGCACGCCCA CCCGGCAGAG TTACGACCTG CTGGAGTCGG CGTTCGGTGC GGGAGTGAAC GGACCGCTGC TGGTGGTCGC CGACCTGGGC GACGCCGCTG GGGGCGACCG GGCCGCGATG CAGCAGCAGG CCCTTGCCGT CAACTCCGCC CTGGCCAGCG TGCCCGGCGT GAGCTCCGTG CAGGGTCCAC AGGTCTCCGA CGACGGCAGT GCCGCGCTCT GGCAGGTGGT GCCGACGACC GCGCCCAGCG ATCCGGCCAC CGGTGACCTG GTCACCGAGC TCCGCGAGGA GATCCTGCCA CCGCTGGCAA CCGACGGTAC GCAGCTGCAC GTCGGCGGCC AGACTGCCGC GAAGATCGAC TTCACCGATC AGGTGGCCGA CCGGCTGCCG TTGGTCCTGG CGGTGGTGAT CGCGCTGAGC TTCCTGCTGC TGGTCATCTT GTTCCGATCA GTCGTGATCC CGCTGACCGC CGCGTTGATG AACCTGCTCT CCGTCGGTGC CGCGTACGGC ATCCTCACCT TCGCCTTCGC CGAGGGGCAC CTTACGGCGC TGCTCGGACT GGATGGGCCG GTGCCGATCG AGAGCTACAT CCCACTGATC CTCTTCGCGG TCCTGTTCGG ACTGTCCATG GACTACGAGG TCTTCCTGGT CTCGTCGATC GCCGAGCGGT GGCGTGCGGA GCGGGACAAC CGGCGTGCGG TGGTGACCGG GCTCGGCTCG GCGGGGCGGG TCGTCACCGC GGCGGCGCTG ATCATGTTCA GCGTCTTCAT CAGCTTTGCC GGCCAGGACA ACCCGGTGAT CAAGATGTTC GGGGTGGGGC TCGGGTTGGC GGTGCTGCTC GACGCGGTGG TCGTCCGCGG GTTCCTGGTG CCGGGGATCA TGGTGCTGCT CGGCCGTGCC AACTGGTGGT TCCCCCGCTG GCTGGAGCGG ATCATGCCAC GGGTCGATCT GGAGGCTCAC CCCTCGGCCG GCGAAACTCC CGCTGGGCTC CCGCCGCTCG ACGAAGCGTC CGATGGGCTC CCGCCGGCTG GTGAAGCGTC CGACGGGTTC CCGCCGGTCG ACGGCCCGGT CCTCGAGACC AGGTGA
|
Protein sequence | MRNALYSYGR LAARRPWQLL AAWLLVAAAI VGAWSVVGTT IDDDVRIPGS DSNRAQEVQA AVFPASALGN GTLVFHNADG SVTSTEDRTA IEASLRAVQE LDEVTQVVSP FPQQPGQPAP RVSADGHTAY ARVYFDVPSA ALDEQAADRV LAAAAPATDA GLEVLPGGQL AKAAAGDPGH RSELIGLAVA AVVLLVALGA AAAMALPIIS ALVGLVLGLA AIGLLSQFGA IPDLATTVAS MISLGVGIDY ALFIVVRYRA ARQEGDSHER ALGVAVATAG AAVLFAGATV AVGLGGLLLA GLPLLTSLGW TAAVAVGLSV LAAVGVLPAV LGIVGSRLGA GALLWHRSNA PKAGWWRRIG EGTARRPWLA VVGSLMVLAV FIAPVAGLTL GQQDDGHDPA GTPTRQSYDL LESAFGAGVN GPLLVVADLG DAAGGDRAAM QQQALAVNSA LASVPGVSSV QGPQVSDDGS AALWQVVPTT APSDPATGDL VTELREEILP PLATDGTQLH VGGQTAAKID FTDQVADRLP LVLAVVIALS FLLLVILFRS VVIPLTAALM NLLSVGAAYG ILTFAFAEGH LTALLGLDGP VPIESYIPLI LFAVLFGLSM DYEVFLVSSI AERWRAERDN RRAVVTGLGS AGRVVTAAAL IMFSVFISFA GQDNPVIKMF GVGLGLAVLL DAVVVRGFLV PGIMVLLGRA NWWFPRWLER IMPRVDLEAH PSAGETPAGL PPLDEASDGL PPAGEASDGF PPVDGPVLET R
|
| |