Gene Sare_1386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1386 
Symbol 
ID5703745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1600205 
End bp1602490 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content72% 
IMG OID641270896 
ProductMMPL domain-containing protein 
Protein accessionYP_001536277 
Protein GI159037024 
COG category[R] General function prediction only 
COG ID[COG2409] Predicted drug exporters of the RND superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.33888 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.278999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGTAACG CCCTGTACTC GTACGGCCGG CTGGCCGCCC GCCGGCCCTG GCAGCTGCTC 
GCCGCCTGGC TGCTGGTGGC CGCGGCCATC GTCGGCGCCT GGTCGGTGGT GGGCACCACC
ATCGACGACG ACGTCCGCAT TCCCGGCAGC GACAGCAACC GCGCCCAGGA GGTCCAGGCT
GCGGTGTTCC CCGCCTCCGC GCTCGGCAAC GGCACCCTGG TGTTCCACAA CGCGGACGGG
TCTGTCACCT CCACCGAGGA TCGGACGGCC ATCGAGGCGT CGCTCCGCGC GGTGCAGGAA
CTGGATGAGG TGACGCAGGT GGTGTCACCG TTTCCGCAGC AGCCCGGCCA GCCCGCGCCG
CGGGTCAGCG CCGACGGGCA CACCGCGTAC GCGCGGGTCT ACTTCGACGT GCCCTCCGCC
GCCCTCGACG AGCAGGCCGC CGACCGCGTG TTGGCCGCCG CCGCTCCGGC CACGGACGCC
GGGCTGGAGG TACTACCCGG TGGCCAGCTC GCCAAGGCGG CGGCCGGCGA TCCCGGGCAC
CGCAGCGAGC TGATCGGCCT GGCGGTGGCT GCCGTCGTCC TGTTGGTAGC CCTCGGCGCG
GCCGCGGCGA TGGCCCTACC GATCATCTCC GCGCTGGTCG GCCTCGTCCT CGGGCTCGCC
GCCATCGGGC TGCTCAGCCA GTTCGGGGCG ATCCCGGACC TCGCGACGAC GGTGGCGAGC
ATGATTTCGC TCGGCGTCGG CATCGACTAC GCGCTGTTCA TCGTCGTGCG CTACCGCGCG
GCCCGGCAGG AAGGCGACTC GCACGAGCGG GCGCTGGGTG TCGCGGTGGC CACCGCAGGC
GCGGCCGTGC TCTTCGCCGG CGCCACCGTC GCCGTCGGCC TGGGCGGCCT GCTGCTGGCC
GGGCTGCCGC TGCTGACCTC GCTGGGGTGG ACCGCCGCCG TGGCGGTCGG GTTGTCGGTG
CTGGCCGCGG TCGGTGTGCT GCCGGCCGTA CTCGGGATCG TCGGCTCGCG ACTCGGTGCC
GGGGCACTGC TGTGGCATCG GTCGAACGCC CCGAAGGCTG GCTGGTGGCG CCGGATCGGC
GAGGGCACGG CCCGGCGGCC CTGGCTGGCG GTGGTCGGTT CACTCATGGT GCTGGCGGTG
TTCATCGCTC CGGTGGCTGG CCTGACGCTT GGGCAGCAGG ACGACGGCCA CGACCCGGCA
GGCACGCCCA CCCGGCAGAG TTACGACCTG CTGGAGTCGG CGTTCGGTGC GGGAGTGAAC
GGACCGCTGC TGGTGGTCGC CGACCTGGGC GACGCCGCTG GGGGCGACCG GGCCGCGATG
CAGCAGCAGG CCCTTGCCGT CAACTCCGCC CTGGCCAGCG TGCCCGGCGT GAGCTCCGTG
CAGGGTCCAC AGGTCTCCGA CGACGGCAGT GCCGCGCTCT GGCAGGTGGT GCCGACGACC
GCGCCCAGCG ATCCGGCCAC CGGTGACCTG GTCACCGAGC TCCGCGAGGA GATCCTGCCA
CCGCTGGCAA CCGACGGTAC GCAGCTGCAC GTCGGCGGCC AGACTGCCGC GAAGATCGAC
TTCACCGATC AGGTGGCCGA CCGGCTGCCG TTGGTCCTGG CGGTGGTGAT CGCGCTGAGC
TTCCTGCTGC TGGTCATCTT GTTCCGATCA GTCGTGATCC CGCTGACCGC CGCGTTGATG
AACCTGCTCT CCGTCGGTGC CGCGTACGGC ATCCTCACCT TCGCCTTCGC CGAGGGGCAC
CTTACGGCGC TGCTCGGACT GGATGGGCCG GTGCCGATCG AGAGCTACAT CCCACTGATC
CTCTTCGCGG TCCTGTTCGG ACTGTCCATG GACTACGAGG TCTTCCTGGT CTCGTCGATC
GCCGAGCGGT GGCGTGCGGA GCGGGACAAC CGGCGTGCGG TGGTGACCGG GCTCGGCTCG
GCGGGGCGGG TCGTCACCGC GGCGGCGCTG ATCATGTTCA GCGTCTTCAT CAGCTTTGCC
GGCCAGGACA ACCCGGTGAT CAAGATGTTC GGGGTGGGGC TCGGGTTGGC GGTGCTGCTC
GACGCGGTGG TCGTCCGCGG GTTCCTGGTG CCGGGGATCA TGGTGCTGCT CGGCCGTGCC
AACTGGTGGT TCCCCCGCTG GCTGGAGCGG ATCATGCCAC GGGTCGATCT GGAGGCTCAC
CCCTCGGCCG GCGAAACTCC CGCTGGGCTC CCGCCGCTCG ACGAAGCGTC CGATGGGCTC
CCGCCGGCTG GTGAAGCGTC CGACGGGTTC CCGCCGGTCG ACGGCCCGGT CCTCGAGACC
AGGTGA
 
Protein sequence
MRNALYSYGR LAARRPWQLL AAWLLVAAAI VGAWSVVGTT IDDDVRIPGS DSNRAQEVQA 
AVFPASALGN GTLVFHNADG SVTSTEDRTA IEASLRAVQE LDEVTQVVSP FPQQPGQPAP
RVSADGHTAY ARVYFDVPSA ALDEQAADRV LAAAAPATDA GLEVLPGGQL AKAAAGDPGH
RSELIGLAVA AVVLLVALGA AAAMALPIIS ALVGLVLGLA AIGLLSQFGA IPDLATTVAS
MISLGVGIDY ALFIVVRYRA ARQEGDSHER ALGVAVATAG AAVLFAGATV AVGLGGLLLA
GLPLLTSLGW TAAVAVGLSV LAAVGVLPAV LGIVGSRLGA GALLWHRSNA PKAGWWRRIG
EGTARRPWLA VVGSLMVLAV FIAPVAGLTL GQQDDGHDPA GTPTRQSYDL LESAFGAGVN
GPLLVVADLG DAAGGDRAAM QQQALAVNSA LASVPGVSSV QGPQVSDDGS AALWQVVPTT
APSDPATGDL VTELREEILP PLATDGTQLH VGGQTAAKID FTDQVADRLP LVLAVVIALS
FLLLVILFRS VVIPLTAALM NLLSVGAAYG ILTFAFAEGH LTALLGLDGP VPIESYIPLI
LFAVLFGLSM DYEVFLVSSI AERWRAERDN RRAVVTGLGS AGRVVTAAAL IMFSVFISFA
GQDNPVIKMF GVGLGLAVLL DAVVVRGFLV PGIMVLLGRA NWWFPRWLER IMPRVDLEAH
PSAGETPAGL PPLDEASDGL PPAGEASDGF PPVDGPVLET R